Nucleic acid molecules encoding proteins essential for plant growth and development and uses thereof

ABSTRACT

Nucleotide sequences are isolated from  Arabidopsis thaliana  that code for proteins essential for plant growth and development. The essentially of the proteins may be exploited by recombinantly expressing the proteins and using them in screening assays to identify compounds that interact with or inhibit the proteins and are therefore potential herbicides.

[0001] This application claims the benefit of U.S. Provisional Application No. 60/423,519 filed Nov. 4, 2002, which is incorporated herein by reference.

FIELD OF THE INVENTION

[0002] The present invention pertains to nucleic acid molecules isolated from Arabidopsis thaliana comprising nucleotide sequences that encode proteins essential for plant growth and development. The invention particularly relates to methods of using these proteins as herbicide targets, based on this essentiality.

BACKGROUND OF THE INVENTION

[0003] The use of herbicides to control undesirable vegetation such as weeds in crop fields has become almost a universal practice. The herbicide market exceeds 15 billion dollars annually. Despite this extensive use, weed control remains a significant and costly problem for farmers.

[0004] Effective use of herbicides requires sound management. For instance, the time and method of application and stage of weed plant development are critical to achieving good weed control with herbicides. Because various weed species are resistant to herbicides, the production of effective new herbicides becomes increasingly important. New herbicides can now be discovered using high-throughput screens that implement recombinant DNA technology. Metabolic enzymes found to be essential to plant growth and development can be recombinantly produced through standard molecular biological techniques and utilized as herbicide targets in screens for novel inhibitors of the enzyme activity. More generally, any essential plant protein can be used to screen for inhibitors of its activity. The novel inhibitors discovered through such screens may then be used as herbicides to control undesirable vegetation.

[0005] In view of the above, there remain persistent and ongoing problems with unwanted or detrimental vegetation growth (e.g. weeds). Furthermore, as the population continues to grow, there will be increasing food shortages. Therefore, there exists a long felt, yet unfulfilled need, to find new, effective, and economic herbicides.

SUMMARY OF THE INVENTION

[0006] In view of these needs, it is an object of the invention to provide nucleic acid molecules from Arabidopsis thaliana comprising nucleotide sequences that encode proteins essential for plant growth and development. It is another object to provide the essential proteins encoded by these essential nucleotide sequences for assay development to identify inhibitory compounds with herbicidal activity. It is still another object of the present invention to provide an effective and beneficial method for identifying new or improved herbicides using the essential proteins of the invention.

[0007] In furtherance of these and other objects, the present invention provides nucleic acid molecules isolated from Arabidopsis thaliana comprising nucleotide sequences that encode proteins essential for plant viability. Genetic results show that when any of the nucleotide sequences of the invention are mutated in Arabidopsis thaliana, the resulting phenotype is embryo or seedling lethal in the homozygous state. In particular, by using Acids transposon or T-DNA-mediated mutagenesis, the inventors of the present invention are the first to demonstrate that the activity of each protein of the present invention is essential for plant growth in Arabidopsis thaliana.

[0008] This knowledge is exploited to provide novel herbicide modes of action. The critical role in plant growth of the proteins encoded by each of the nucleotide sequences of the invention implies that chemicals that inhibit the function of any one of these proteins in plants are likely to have detrimental effects on plants and are potentially good herbicide candidates. Thus, the proteins encoded by the essential nucleotide sequences provide the bases for assays designed to easily and rapidly identify novel herbicides.

[0009] The present invention therefore provides methods of using a purified protein encoded by any one of the nucleotide sequences described below to identify inhibitors thereof, which can then be used as herbicides to suppress the growth of undesirable vegetation, e.g. in fields where crops are grown, particularly agronomically important crops such as maize and other cereal crops such as wheat, oats, rye, sorghum, rice, barley, millet, turf and forage grasses, and the like, as well as cotton, sugar cane, sugar beet, oilseed rape, and soybeans.

[0010] Disclosed herein are nucleic acid molecules isolated from Arabidopsis thaliana. In one embodiment, the present invention provides an isolated nucleic acid molecule comprising a nucleotide sequence, the complement of which hybridizes under stringent conditions to a sequence selected from the group consisting of the odd numbered SEQ ID NOs: 1-47. In another embodiment, the present invention provides an isolated nucleic acid molecule comprising a nucleotide sequence that encodes a protein comprising an amino acid sequence having at least 60%, preferably 70%, more preferably 80%, still more preferably 90%, even more preferably 95%, and most preferably 99-100% sequence identity to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.

[0011] The present invention also provides a chimeric construct comprising a promoter operatively linked to a nucleic acid molecule according to the present invention, wherein the promoter is preferably functional in a eukaryote, wherein the promoter is preferably heterologous to the nucleic acid molecule. The present invention further provides a recombinant vector comprising a chimeric construct according to the present invention, wherein said vector is capable of being stably transformed into a host cell. The present invention still further provides a host cell comprising a nucleic acid molecule according to the present invention, wherein said nucleic acid molecule is preferably expressible in the cell. The host cell is preferably selected from the group consisting of a plant cell, a yeast cell, an insect cell, and a prokaryotic cell. The present invention additionally provides a plant or seed comprising a plant cell according to the present invention.

[0012] The present invention also provides proteins essential for plant growth in Arabidopsis thaliana. In one embodiment, the present invention provides an isolated protein comprising an amino acid sequence having at least 60%, preferably 70%, more preferably 80%, still more preferably 90%, even more preferably 95%, and most preferably 99-100% sequence identity to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48. In accordance with another embodiment, the present invention also relates to the recombinant production of proteins of the invention and methods of using the proteins of the invention in assays for identifying compounds that interact with the protein.

[0013] According to another aspect, the present invention provides a method of identifying a herbicidal compound, comprising: (a) combining a polypeptide comprising an amino acid sequence at least 90% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48 with a compound to be tested for the ability to bind to said polypeptide, under conditions conducive to binding; (b) selecting a compound identified in (a) that binds to said polypeptide; (c) applying a compound selected in (b) to a plant to test for herbicidal activity; and (d) selecting a compound identified in (c) that has herbicidal activity. Preferably, the polypeptide comprises an amino acid sequence at least 95% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48. More preferably, the polypeptide comprises an amino acid sequence at least 99% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48. Most preferably, the polypeptide comprises an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48. The present invention also provides a method for killing or inhibiting the growth or viability of a plant, comprising applying to the plant a herbicidal compound identified according to this method.

[0014] According to yet another aspect, the present invention provides a method of identifying a herbicidal compound, comprising: (a) combining a polypeptide comprising an amino acid sequence at least 90% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48 with a compound to be tested for the ability to inhibit the activity of said polypeptide, under conditions conducive to inhibition; (b) selecting a compound identified in (a) that inhibits the activity of said polypeptide; (c) applying a compound selected in (b) to a plant to test for herbicidal activity; and (d) selecting a compound identified in (c) that has herbicidal activity. Preferably, the polypeptide comprises an amino acid sequence at least 95% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48. More preferably, the polypeptide comprises an amino acid sequence at least 99% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48. Most preferably, the polypeptide comprises an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48. The present invention also provides a method for killing or inhibiting the growth or viability of a plant, comprising applying to the plant a herbicidal compound identified according to this method.

[0015] The present invention still further provides a method for killing or inhibiting the growth or viability of a plant, comprising inhibiting expression in said plant of a protein having at least 60%, preferably 70%, more preferably 80%, still more preferably 90%, even more preferably 95%, and most preferably 99-100% sequence identity to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.

[0016] Other objects and advantages of the present invention will become apparent to those skilled in the art and from a study of the following description of the invention and non-limiting examples. The entire contents of all publications mentioned herein are hereby incorporated by reference.

BRIEF DESCRIPTION OF THE SEQUENCES IN THE SEQUENCE LISTING

[0017] Odd numbered SEQ ID NOs: 1-47 are nucleotide sequences isolated from Arabidopsis thaliana that are more fully described in Table 5 below.

[0018] Even numbered SEQ ID NOs:2-48 are protein sequences encoded by the immediately preceding nucleotide sequence, e.g., SEQ ID NO:2 is the protein encoded by the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO:4 is the protein encoded by the nucleotide sequence of SEQ ID NO:3, etc.

[0019] SEQ ID NOs:49-73 are PCR primers.

DEFINITIONS

[0020] For clarity, certain terms used in the specification are defined and presented as follows:

[0021] “Associated with/operatively linked” refer to two nucleic acid sequences that are related physically or functionally. For example, a promoter or regulatory DNA sequence is said to be “associated with” a DNA sequence that codes for an RNA or a protein if the two sequences are operatively linked, or situated such that the regulator DNA sequence will affect the expression level of the coding or structural DNA sequence.

[0022] A “chimeric construct” is a recombinant nucleic acid sequence in which a promoter or regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid sequence that codes for an mRNA or which is expressed as a protein, such that the regulatory nucleic acid sequence is able to regulate transcription or expression of the associated nucleic acid sequence. The regulatory nucleic acid sequence of the chimeric construct is not normally operatively linked to the associated nucleic acid sequence as found in nature.

[0023] Co-factor: natural reactant, such as an organic molecule or a metal ion, required in an enzyme-catalyzed reaction. A co-factor is e.g. NAD(P), riboflavin (including FAD and FMN), folate, molybdopterin, thiamin, biotin, lipoic acid, pantothenic acid and coenzyme A, S-adenosylmethionine, pyridoxal phosphate, ubiquinone, menaquinone. Optionally, a co-factor can be regenerated and reused.

[0024] A “coding sequence” is a nucleic acid sequence that is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA. Preferably the RNA is then translated in an organism to produce a protein.

[0025] Complementary: “complementary” refers to two nucleotide sequences that comprise antiparallel nucleotide sequences capable of pairing with one another upon formation of hydrogen bonds between the complementary base residues in the antiparallel nucleotide sequences.

[0026] Enzyme activity: means herein the ability of an enzyme to catalyze the conversion of a substrate into a product. A substrate for the enzyme comprises the natural substrate of the enzyme but also comprises analogues of the natural substrate, which can also be converted, by the enzyme into a product or into an analogue of a product. The activity of the enzyme is measured for example by determining the amount of product in the reaction after a certain period of time, or by determining the amount of substrate remaining in the reaction mixture after a certain period of time. The activity of the enzyme is also measured by determining the amount of an unused co-factor of the reaction remaining in the reaction mixture after a certain period of time or by determining the amount of used co-factor in the reaction mixture after a certain period of time. The activity of the enzyme is also measured by determining the amount of a donor of free energy or energy-rich molecule (e.g. ATP, phosphoenolpyruvate, acetyl phosphate or phosphocreatine) remaining in the reaction mixture after a certain period of time or by determining the amount of a used donor of free energy or energy-rich molecule (e.g. ADP, pyruvate, acetate or creatine) in the reaction mixture after a certain period of time.

[0027] Essential: an “essential” Arabidopsis thaliana nucleotide sequence is a nucleotide sequence encoding a protein such as e.g. a biosynthetic enzyme, receptor, signal transduction protein, structural gene product, or transport protein that is essential to the growth or survival of the plant.

[0028] Expression Cassette: “Expression cassette” as used herein means a nucleic acid molecule capable of directing expression of a particular nucleotide sequence in an appropriate host cell, comprising a promoter operatively linked to the nucleotide sequence of interest which is operatively linked to termination signals. It also typically comprises sequences required for proper translation of the nucleotide sequence. The coding region usually codes for a protein of interest but may also code for a functional RNA of interest, for example antisense RNA or a nontranslated RNA, in the sense or antisense direction. The expression cassette comprising the nucleotide sequence of interest may be chimeric, meaning that at least one of its components is heterologous with respect to at least one of its other components. The expression cassette may also be one that is naturally occurring but has been obtained in a recombinant form useful for heterologous expression. Typically, however, the expression cassette is heterologous with respect to the host, i.e., the particular DNA sequence of the expression cassette does not occur naturally in the host cell and must have been introduced into the host cell or an ancestor of the host cell by a transformation event. The expression of the nucleotide sequence in the expression cassette may be under the control of a constitutive promoter or of an inducible promoter that initiates transcription only when the host cell is exposed to some particular external stimulus. In the case of a multicellular organism, such as a plant, the promoter can also be specific to a particular tissue or organ or stage of development.

[0029] Gene: the term “gene” is used broadly to refer to any segment of DNA associated with a biological function. Thus, genes include coding sequences and/or the regulatory sequences required for their expression. Genes also include nonexpressed DNA segments that, for example, form recognition sequences for other proteins. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence information, and may include sequences designed to have desired parameters.

[0030] Heterologous/exogenous: The terms “heterologous” and “exogenous” when used herein to refer to a nucleic acid sequence (e.g. a DNA sequence) or a gene, refer to a sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has been modified through, for example, the use of DNA shuffling. The terms also include non-naturally occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the host cell nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are expressed to yield exogenous polypeptides.

[0031] A “homologous” nucleic acid (e.g. DNA) sequence is a nucleic acid (e.g. DNA) sequence naturally associated with a host cell into which it is introduced.

[0032] Hybridization: The phrase “hybridizing specifically to” refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. “Bind(s) substantially” refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target nucleic acid sequence.

[0033] Inhibitor: a chemical substance that inactivates the enzymatic activity of a protein such as a biosynthetic enzyme, receptor, signal transduction protein, structural gene product, or transport protein. The term “herbicide” (or “herbicidal compound”) is used herein to define an inhibitor applied to a plant at any stage of development, whereby the herbicide inhibits the growth of the plant or kills the plant.

[0034] Interaction: quality or state of mutual action such that the effectiveness or toxicity of one protein or compound on another protein is inhibitory (antagonists) or enhancing (agonists).

[0035] A nucleic acid sequence is “isocoding with” a reference nucleic acid sequence when the nucleic acid sequence encodes a polypeptide having the same amino acid sequence as the polypeptide encoded by the reference nucleic acid sequence.

[0036] Isogenic: plants that are genetically identical, except that they may differ by the presence or absence of a heterologous DNA sequence.

[0037] Isolated: in the context of the present invention, an isolated DNA molecule or an isolated enzyme is a DNA molecule or enzyme that, by the hand of man, exists apart from its native environment and is therefore not a product of nature. An isolated DNA molecule or enzyme may exist in a purified form or may exist in a non-native environment such as, for example, in a transgenic host cell.

[0038] Mature protein: protein from which the transit peptide, signal peptide, and/or propeptide portions have been removed.

[0039] Minimal Promoter: the smallest piece of a promoter, such as a TATA element, that can support any transcription. A minimal promoter typically has greatly reduced promoter activity in the absence of upstream activation. In the presence of a suitable transcription factor, the minimal promoter functions to permit transcription.

[0040] Modified Enzyme Activity: enzyme activity different from that which naturally occurs in a plant (i.e. enzyme activity that occurs naturally in the absence of direct or indirect manipulation of such activity by man), which is tolerant to inhibitors that inhibit the naturally occurring enzyme activity.

[0041] Native: refers to a gene that is present in the genome of an untransformed plant cell.

[0042] Naturally occurring: the term “naturally occurring” is used to describe an object that can be found in nature as distinct from being artificially produced by man. For example, a protein or nucleotide sequence present in an organism (including a virus), which can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory, is naturally occurring.

[0043] Nucleic acid: the term “nucleic acid” refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides which have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g. degenerate codon substitutions) and complementary sequences and as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19: 5081 (1991); Ohtsuka et al., J Biol. Chem. 260: 2605-2608 (1985); Rossolini et al., Mol. Cell. Probes 8: 91-98 (1994)). The terms “nucleic acid” or “nucleic acid sequence” may also be used interchangeably with gene, cDNA, and mRNA encoded by a gene.

[0044] “ORF” means open reading frame.

[0045] Percent identity: the phrases “percent identical” or “percent identical,” in the context of two nucleic acid or protein sequences, refers to two or more sequences or subsequences that have for example 60%, preferably 70%, more preferably 80%, still more preferably 90%, even more preferably 95%, and most preferably at least 99% nucleotide or amino acid residue identity, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection. Preferably, the percent identity exists over a region of the sequences that is at least about 50 residues in length, more preferably over a region of at least about 100 residues, and most preferably the percent identity exists over at least about 150 residues. In an especially preferred embodiment, the percent identity exists over the entire length of the coding regions.

[0046] For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.

[0047] Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J Mol. Biol. 48: 443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. Acad. Sci. USA 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by visual inspection (see generally, Ausubel et al., infra).

[0048] One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al., J Mol. Biol. 215: 403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., 1990). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always>0) and N (penalty score for mismatching residues; always<0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when the cumulative alignment score falls off by the quantity X from its maximum achieved value, the cumulative score goes to zero or below due to the accumulation of one or more negative-scoring residue alignments, or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, a cutoff of 100, M=5, N=−4, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89: 10915 (1989)).

[0049] In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci. USA 90: 5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a test nucleic acid sequence is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid sequence to the reference nucleic acid sequence is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.

[0050] Pre-protein: protein that is normally targeted to a cellular organelle, such as a chloroplast, and still comprises its native transit peptide.

[0051] Purified: the term “purified,” when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is essentially free of other cellular components with which it is associated in the natural state. It is preferably in a homogeneous state although it can be in either a dry or aqueous solution. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified. The term “purified” denotes that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least about 50% pure, more preferably at least about 85% pure, and most preferably at least about 99% pure.

[0052] Two nucleic acids are “recombined” when sequences from each of the two nucleic acids are combined in a progeny nucleic acid. Two sequences are “directly” recombined when both of the nucleic acids are substrates for recombination. Two sequences are “indirectly recombined” when the sequences are recombined using an intermediate such as a cross-over oligonucleotide. For indirect recombination, no more than one of the sequences is an actual substrate for recombination, and in some cases, neither sequence is a substrate for recombination.

[0053] “Regulatory elements” refer to sequences involved in controlling the expression of a nucleotide sequence. Regulatory elements comprise a promoter operatively linked to the nucleotide sequence of interest and termination signals. They also typically encompass sequences required for proper translation of the nucleotide sequence.

[0054] Significant Increase: an increase in enzymatic activity that is larger than the margin of error inherent in the measurement technique, preferably an increase by about 2-fold or greater of the activity of the wild-type enzyme in the presence of the inhibitor, more preferably an increase by about 5-fold or greater, and most preferably an increase by about 10-fold or greater.

[0055] Significantly less: means that the amount of a product of an enzymatic reaction is reduced by more than the margin of error inherent in the measurement technique, preferably a decrease by about 2-fold or greater of the activity of the wild-type enzyme in the absence of the inhibitor, more preferably an decrease by about 5-fold or greater, and most preferably an decrease by about 10-fold or greater.

[0056] Specific Binding/Immunological Cross-Reactivity: An indication that two nucleic acid sequences or proteins are substantially identical is that the protein encoded by the first nucleic acid is immunologically cross reactive with, or specifically binds to, the protein encoded by the second nucleic acid. Thus, a protein is typically substantially identical to a second protein, for example, where the two proteins differ only by conservative substitutions. The phrase “specifically (or selectively) binds to an antibody,” or “specifically (or selectively) immunoreactive with,” when referring to a protein or peptide, refers to a binding reaction which is determinative of the presence of the protein in the presence of a heterogeneous population of proteins and other biologics. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein and do not bind in a significant amount to other proteins present in the sample. Specific binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. For example, antibodies raised to the protein with the amino acid sequence encoded by any of the nucleic acid sequences of the invention can be selected to obtain antibodies specifically immunoreactive with that protein and not with other proteins except for polymorphic variants. A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular protein. For example, solid-phase ELISA immunoassays, Western blots, or immunohistochemistry are routinely used to select monoclonal antibodies specifically immunoreactive with a protein. See Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York “Harlow and Lane”), for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity. Typically a specific or selective reaction will be at least twice background signal or noise and more typically more than 10 to 100 times background.

[0057] “Stringent hybridization conditions” and “stringent hybridization wash conditions” in the context of nucleic acid hybridization experiments such as Southern and Northern hybridizations are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes part I chapter 2 “Overview of principles of hybridization and the strategy of nucleic acid probe assays” Elsevier, N.Y. Generally, highly stringent hybridization and wash conditions are selected to be about 5° C. lower than the thermal melting point (T_(m)) for the specific sequence at a defined ionic strength and pH. Typically, under “stringent conditions” a probe will hybridize to its target subsequence, but to no other sequences.

[0058] The T_(m) is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the T_(m) for a particular probe. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42° C., with the hybridization being carried out overnight. An example of highly stringent wash conditions is 0.15M NaCl at 72° C. for about 15 minutes. An example of stringent wash conditions is a 0.2×SSC wash at 65° C. for 15 minutes (see, Sambrook, infra, for a description of SSC buffer). Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is 1×SSC at 45° C. for 15 minutes. An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6×SSC at 40° C. for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30° C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2×(or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the proteins that they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.

[0059] The following are examples of sets of hybridization/wash conditions that may be used to clone nucleotide sequences that are homologues of reference nucleotide sequences of the present invention: a reference nucleotide sequence preferably hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO₄, 1 mM EDTA at 50° C. with washing in 2×SSC, 0.1% SDS at 50° C., more desirably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO₄, 1 mM EDTA at 50° C. with washing in 1×SSC, 0.1% SDS at 50° C., more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO₄, 1 mM EDTA at 50° C. with washing in 0.5×SSC, 0.1% SDS at 50° C., preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO₄, 1 mM EDTA at 50° C. with washing in 0.×SSC, 0.1% SDS at 50° C, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO₄, 1 mM EDTA at 50° C. with washing in 0.×SSC, 0.1% SDS at 65° C.

[0060] A “subsequence” refers to a sequence of nucleic acids or amino acids that comprise a part of a longer sequence of nucleic acids or amino acids (e.g., protein) respectively.

[0061] Substrate: a substrate is the molecule that an enzyme naturally recognizes and converts to a product in the biochemical pathway in which the enzyme naturally carries out its function, or is a modified version of the molecule, which is also recognized by the enzyme and is converted by the enzyme to a product in an enzymatic reaction similar to the naturally-occurring reaction.

[0062] Transformation: a process for introducing heterologous DNA into a plant cell, plant tissue, or plant. Transformed plant cells, plant tissue, or plants are understood to encompass not only the end product of a transformation process, but also transgenic progeny thereof.

[0063] “Transformed,” “transgenic,” and “recombinant” refer to a host organism such as a bacterium or a plant into which a heterologous nucleic acid molecule has been introduced. The nucleic acid molecule can be stably integrated into the genome of the host or the nucleic acid molecule can also be present as an extrachromosomal molecule. Such an extrachromosomal molecule can be auto-replicating. Transformed cells, tissues, or plants are understood to encompass not only the end product of a transformation process, but also transgenic progeny thereof. A “non-transformed,” “non-transgenic,” or “non-recombinant” host refers to a wild-type organism, e.g., a bacterium or plant, which does not contain the heterologous nucleic acid molecule.

[0064] Viability: “viability” as used herein refers to a fitness parameter of a plant. Plants are assayed for their homozygous performance of plant development, indicating which proteins are essential for plant growth.

DETAILED DESCRIPTION OF THE INVENTION

[0065] I. Identification of Essential Arabidopsis thaliana Nucleotide Sequences and Encoded Proteins Using Acids Transposon or T-DNA-Mediated Mutagenesis

[0066] As shown in the examples below, the essentiality of the nucleotide sequences described herein for normal plant growth and development, have been demonstrated for the first time in Arabidopsis using Ac/Ds transposon or T-DNA-mediated mutagenesis. Having established the essentiality of the function of the encoded proteins in Arabidopsis thaliana and having identified the nucleotide sequences encoding these essential proteins, the inventors thereby provide an important and sought after tool for new herbicide development.

[0067] Arabidopsis insertional mutant lines segregating for seedling lethal mutations are identified as a first step in the identification of essential proteins. Starting with T2 seeds collected from single Ti plants containing T-DNA insertions in their genomes, those lines segregating homozygous seedling lethal seedlings are identified. Ds transposon insertion lines are produced as described in Sundaresan et al. (1995) (Genes and Dev., 9:1797-1810), incorporated herein by reference. Starting with F3 or F4 seeds collected from single F2 or F3 kanamycin-resistant plants containing Ds insertions in their genomes (see FIG. 3 of Sundaresan et al. (1995) (Genes and Dev., 9:1797-1810), those lines segregating homozygous seedling lethal seedlings are identified. These lines are found by placing seeds onto minimal plant growth media, which contains the fungicides benomyl and maxim, and screening for inviable seedlings after 7 and 14 days in the light at room temperature. Inviable phenotypes include altered pigmentation or altered morphology. These phenotypes are observed either on plates directly or in soil following transplantation of seedlings.

[0068] Essential genes are also identified through the isolation of lethal mutants blocked in early development. Examples of lethal mutants include those blocked in the formation of the male or female gametes or embryo. Gametophytic mutants are found by examining T1 insertion lines for the presence of 50% aborted pollen grains or ovules. Embryo defective mutants produce 25% defective seeds following self-pollination of T1 plants (see Errampalli et al. 1991, Plant Cell 3:149-157; Castle et al. 1993, Mol Gen Genet 241:504-514).

[0069] When a line is identified as segregating a seedling lethal or an embryo defective phenotype, it is determined if the resistance marker in the Ds transposon or T-DNA insertion co-segregates with the lethality (Errampalli et al. (1991) The Plant Cell, 3:149-157). Cosegregation analysis is done by placing the seeds on media containing the selective agent and scoring the seedlings for resistance or sensitivity to the agent. Examples of selective agents used are kanamycin, hygromycin, or phosphinothricin. About 35 resistant seedlings are transplanted to soil and their progeny are examined for the segregation of the seedling lethal. In the case in which the Ds transposon or T-DNA insertion disrupts an essential gene, there is co-segregation of the resistance phenotype and the seedling lethal or embryo defective phenotype in every plant. Therefore, in such a case, all resistant plants segregate a seedling lethal or embryo defective phenotype in the next generation; this result indicates that each of the resistant plants is heterozygous for the mutation and hemizygous for the T-DNA insert causing the mutation.

[0070] For the Arabidopsis lines showing co-segregation of the transposon-encoded or T-DNA-encoded resistance marker and the lethal phenotype, PCR-based molecular approaches such as, TAIL-PCR (Liu et al. (1995) Plant J., 8:457-463; Liu and Whittier (1995), Genomics, 25:674-681), TAIL2k, vectorette PCR (Riley et al. (1990) Nucleic Acids Research, 18: 2887-2890)), or the GenomeWalker™ kit (CLONTECH Laboratories, Inc., Palo Alto, Calif.), may be used to directly amplify the plant DNA fragments flanking the transposon or T-DNA. Each of these techniques utilizes the known sequence of the transposon or T-DNA, and can be used to recover small (less than 5 kb) fragments directly adjacent to the insertion. PCR products are isolated and their DNA sequence is determined.

[0071] Alternatively, plasmid rescue may be used to isolate the plant DNA/T-DNA border fragments. Southern blot analysis may be performed as an initial step in the characterization of the molecular nature of each insertion. Southern blots are done with genomic DNA isolated from heterozygotes and using probes capable of hybridizing with the T-DNA vector DNA. Using the results of the Southern analysis, appropriate restriction enzymes are chosen to perform plasmid rescue in order to molecularly clone Arabidopsis thaliana genomic DNA flanking one or both sides of the T-DNA insertion. Plasmids obtained in this manner are analyzed by restriction enzyme digestion to sort the plasmids into classes based on their digestion pattern. For each class of plasmid clone, the DNA sequence is determined.

[0072] The resulting sequences, obtained by any of the above outlined approaches, are analyzed for the presence of non-Ds transposon and non-T-DNA vector sequences, as appropriate. When such sequences are found, they are used to search DNA and protein databases using the BLAST and BLAST2 programs (Altschul et al. (1990) J Mol. Biol. 215: 403-410; Altschul et al. (1997) Nucleic Acid Res. 25:3389-3402, both incorporated herein by reference). Additional genomic and cDNA sequences for each gene are identified by standard molecular biology procedures.

[0073] II. Recombinant Production of Essential Proteins and Uses Thereof

[0074] For recombinant production of a protein of the invention in a host organism, a nucleotide sequence encoding the protein is inserted into an expression cassette designed for the chosen host and introduced into the host where it is recombinantly produced. The choice of the specific regulatory sequences such as promoter, signal sequence, 5′ and 3′ untranslated sequence, and enhancer appropriate for the chosen host is within the level of the skill of the routineer in the art. The resultant molecule, containing the individual elements linking in the proper reading frame, is inserted into a vector capable of being transformed into the host cell. Suitable expression vectors and methods for recombinant production of proteins are well known for host organisms such as E. coli, yeast, and insect cells (see, e.g., Lucknow and Summers, Bio/Technol. 6:47 (1988)). Additional suitable expression vectors are baculovirus expression vectors, e.g., those derived from the genome of Autographica californica nuclear polyhedrosis virus (AcMNPV). A preferred baculovirus/insect system is PVL1392(3) used to transfect Spodoptera frugiperda SF9 cells (ATCC) in the presence of linear Autographica californica baculovirus DNA (Phramingen, San Diego, Calif.). The resulting virus is used to infect HighFive Tricoplusia ni cells (Invitrogen, La Jolla, Calif.).

[0075] Recombinantly produced proteins are isolated and purified using a variety of standard techniques. The actual techniques used vary depending upon the host organism used, whether the protein is designed for secretion, and other such factors. Such techniques are well known to the skilled artisan (see, e.g. chapter 16 of Ausubel, F. et al., “Current Protocols in Molecular Biology”, pub. by John Wiley & Sons, Inc. (1994).

[0076] III. Assays for Characterizing the Essential Proteins

[0077] The recombinantly produced proteins described herein are useful for a variety of purposes. For example, they can be used in in vitro assays to screen known herbicidal chemicals whose target has not been identified to determine if they inhibit protein activity. Such in vitro assays may also be used as more general screens to identify chemicals that inhibit such protein activity and that are therefore novel herbicide candidates. Recombinantly produced proteins may also be used to elucidate the complex structure of these molecules and to further characterize their association with known inhibitors in order to rationally design new inhibitory herbicides. Alternatively, the recombinant protein can be used to isolate antibodies or peptides that modulate the activity and are useful in transgenic solutions.

[0078] IV. In vitro Inhibitor Assay: Discovery of Small Molecule Ligands That Interact with Essential Proteins of Unknown Biochemical Function

[0079] Once a protein has been identified as a potential herbicide target based on its essentiality for normal plant growth and viability, a next step is to develop an assay that allows screening large number of chemicals to determine which ones interact with the protein. Although it is straightforward to develop assays for proteins of known function, developing assays with proteins of unknown functions can be more difficult.

[0080] To address this issue, novel technologies are used that can detect interactions between a protein and a compound without knowing the biological function of the protein. A short description of three methods is presented, including fluorescence correlation spectroscopy, surface-enhanced laser desorption/ionization, and biacore technologies.

[0081] Fluorescence Correlation Spectroscopy (FCS) theory was developed in 1972 but it is only in recent years that the technology to perform FCS became available (Madge et al. (1972) Phys. Rev. Lett., 29: 705-708; Maiti et al. (1997) Proc. Natl. Acad. Sci. USA, 94: 11753-11757). FCS measures the average diffusion rate of a fluorescent molecule within a small sample volume. The sample size can be as low as 10³ fluorescent molecules and the sample volume as low as the cytoplasm of a single bacterium. The diffusion rate is a function of the mass of the molecule and decreases as the mass increases. FCS can therefore be applied to protein-ligand interaction analysis by measuring the change in mass and therefore in diffusion rate of a molecule upon binding. In a typical experiment, the target to be analyzed is expressed as a recombinant protein with a sequence tag, such as a poly-histidine sequence, inserted at the N or C-terminus. The expression takes place in E. coli, yeast or insect cells. The protein is purified by chromatography. For example, the poly-histidine tag can be used to bind the expressed protein to a metal chelate column such as Ni2+ chelated on iminodiacetic acid agarose. The protein is then labeled with a fluorescent tag such as carboxytetramethylrhodamine or BODIPY® (Molecular Probes, Eugene, Oreg.). The protein is then exposed in solution to the potential ligand, and its diffusion rate is determined by FCS using instrumentation available from Carl Zeiss, Inc. (Thornwood, N.Y.). Ligand binding is determined by changes in the diffusion rate of the protein.

[0082] Surface-Enhanced Laser Desorption/Ionization (SELDI) was invented by Hutchens and Yip during the late 1980's (Hutchens and Yip (1993) Rapid Commun. Mass Spectrom. 7: 576-580). When coupled to a time-of-flight mass spectrometer (TOF), SELDI provides a mean to rapidly analyze molecules retained on a chip. It can be applied to ligand-protein interaction analysis by covalently binding the target protein on the chip and analyze by MS the small molecules that bind to this protein (Worrall et al. (1998) Anal. Biochem. 70: 750-756). In a typical experiment, the target to be analyzed is expressed as described for FCS. The purified protein is then used in the assay without further preparation. It is bound to the SELDI chip either by utilizing the poly-histidine tag or by other interaction such as ion exchange or hydrophobic interaction. The chip thus prepared is then exposed to the potential ligand via, for example, a delivery system capable to pipette the ligands in a sequential manner (autosampler). The chip is then submitted to washes of increasing stringency, for example a series of washes with buffer solutions containing an increasing ionic strength. After each wash, the bound material is analyzed by submitting the chip to SELDI-TOF. Ligands that specifically bind the target will be identified by the stringency of the wash needed to elute them.

[0083] Biacore relies on changes in the refractive index at the surface layer upon binding of a ligand to a protein immobilized on the layer. In this system, a collection of small ligands is injected sequentially in a 2-5 microlitre cell with the immobilized protein. Binding is detected by surface plasmon resonance (SPR) by recording laser light refracting from the surface. In general, the refractive index change for a given change of mass concentration at the surface layer, is practically the same for all proteins and peptides, allowing a single method to be applicable for any protein (Liedberg et al. (1983) Sensors Actuators 4: 299-304; Malmquist (1993) Nature, 361: 186-187). In a typical experiment, the target to be analyzed is expressed as described for FCS. The purified protein is then used in the assay without further preparation. It is bound to the Biacore chip either by utilizing the poly-histidine tag or by other interaction such as ion exchange or hydrophobic interaction. The chip thus prepared is then exposed to the potential ligand via the delivery system incorporated in the instruments sold by Biacore (Uppsala, Sweden) to pipette the ligands in a sequential manner (autosampler). The SPR signal on the chip is recorded and changes in the refractive index indicate an interaction between the immobilized target and the ligand. Analysis of the signal kinetics on rate and off rate allows the discrimination between non-specific and specific interaction.

[0084] Another assay for small molecule ligands that interact with a polypeptide is an inhibitor assay. For example, such an inhibitor assay useful for identifying inhibitors of the products of essential plant nucleic acid sequences, such as the essential Arabidopsis proteins described herein, comprises the steps of:

[0085] a) reacting an essential Arabidopsis protein described herein and a substrate thereof in the presence of a suspected inhibitor of the protein's function;

[0086] b) comparing the rate of enzymatic activity of the protein in the presence of the suspected inhibitor to the rate of enzymatic activity under the same conditions in the absence of the suspected inhibitor; and

[0087] c) determining whether the suspected inhibitor inhibits the essential Arabidopsis protein.

[0088] For example, the inhibitory effect on the activity of a hereindescribed essential Arabidopsis protein, may be determined by a reduction or complete inhibition of protein activity in the assay. Such a determination may be made by comparing, in the presence and absence of the candidate inhibitor, the amount of substrate used or intermediate or product made during the reaction.

[0089] V. Production of Peptides

[0090] Phage particles displaying diverse peptide libraries permits rapid library construction, affinity selection, amplification and selection of ligands directed against an essential protein (H. B. Lowman, Annu. Rev. Biophys. Biomol. Struct. 26, 401-424 (1997)). Structural analysis of these selectants can provide new information about ligand-target molecule interactions and then in the process also provide a novel molecule that can enable the development of new herbicides based upon these peptides as leads.

[0091] VI. In Vivo Inhibitor Assay

[0092] In one embodiment, a suspected herbicide, for example identified by in vitro screening, is applied to plants at various concentrations. The suspected herbicide is preferably sprayed on the plants. After application of the suspected herbicide, its effect on the plants, for example death or suppression of growth is recorded.

[0093] In another embodiment, an in vivo screening assay for inhibitors of the activity of a hereindescribed essential protein uses transgenic plants, plant tissue, plant seeds or plant cells capable of overexpressing a nucleotide sequence disclosed herein that encodes an essential protein, wherein the essential protein is enzymatically active in the transgenic plants, plant tissue, plant seeds or plant cells. A chemical is then applied to the transgenic plants, plant tissue, plant seeds or plant cells and to the isogenic non-transgenic plants, plant tissue, plant seeds or plant cells, and the growth or viability of the transgenic and non-transformed plants, plant tissue, plant seeds or plant cells are determined after application of the chemical and compared. Compounds capable of inhibiting the growth of the non-transgenic plants, but not affecting the growth of the transgenic plants are selected as specific inhibitors of the essential protein's activity.

[0094] The invention will be further described by reference to the following detailed examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified.

EXAMPLES

[0095] Standard recombinant DNA and molecular cloning techniques used here are well known in the art and are described by J. Sambrook, et al., Molecular Cloning: A Laboratory Manual, 3d Ed., Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press (2001); by T. J. Silhavy, M. L. Berman, and L. W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1984) and by Ausubel, F. M. et al., Current Protocols in Molecular Biology, New York, John Wiley and Sons Inc., (1988), Reiter, et al., Methods in Arabidopsis Research, World Scientific Press (1992), and Schultz et al., Plant Molecular Biology Manual, Kluwer Academic Publishers (1998). These references describe the standard techniques used for all steps in tagging and cloning genes from Ac/Ds transposon or T-DNA mutagenized populations of Arabidopsis: plant infection and transformation; screening for the identification of seedling mutants; and cosegregation analysis. Ds transposon insertion lines produced as described in Sundaresan et al. (1995) Genes and Dev., 9:1797-1810) are used in these experiments. T-DNA lines are generated using vacuum infiltration or floral dip methods (Bechtold et al. (1993) C. R. Acad. Sci. Paris, 316:1194-1199; Clough and Bent (1998) Plant J., 16:735-743; Desfeux et al. (2000) Plant Physiol., 123:895-904).

Example 1 Identification of Arabidopsis Mutants with Lethal Phenotypes

[0096] Essential genes are identified through the isolation of lethal mutants blocked in early development. Examples of lethal mutants include those blocked in the formation of the male or female gametes, embryo, or resulting seedling. Gametophytic mutants are found by examining insertion lines for the presence of 50% aborted pollen grains or ovules. Embryo defective lethal mutants usually produce 25% defective seeds following self-pollination of plants heterozygous for an insertion (see Errampalli et al. 1991, Plant Cell 3:149-157; Castle et al. 1993, Mol Gen Genet 241:504-514). Seedling lethal mutants usually segregate 25% seedlings that exhibit a lethal phenotype.

Example 2 Cosegregation Analysis for Lines with Lethal Phenotypes

[0097] The linkage of the mutation to the Ds or T-DNA insertion is established after identifying a transformed line segregating for a lethal phenotype of interest. A line segregating with a single functional insert will segregate for resistance in the ratio of about 2:1 (resistant: sensitive) to the selectable marker. In the case of an embryo defective mutant, one-quarter of the progeny of a plant heterozygous for an insertion will fail to germinate due to embryo lethality, resulting in a reduction of the normal 3:1 ratio to 2:1. In the case of a seedling lethal mutant, the seedlings with a mutant phenotype are excluded in the calculation of this ratio. Each of the resistant progeny is therefore heterozygous for the mutation if the Ds or T-DNA insertion is causing the mutant phenotype. To establish cosegregation of the insertion and the mutant phenotype, about 30 resistant progeny are transplanted to soil and each plant is shown to segregate the 25% progeny with a lethal phenotype by the appropriate screening of embryo or seedlings. When all resistant plants segregate the lethal phenotype, there is cosegregation of the insertion and the lethal mutation and the line is designated as “tagged.”

Example 3 T-DNA Border Isolation by Plasmid Rescue

[0098] The plasmid rescue technique is used to molecularly clone Arabidopsis flanking DNA from one or both sides of the T-DNA insertion(s). Arabidopsis genomic DNA is isolated as described by Reiter et al. in Methods in Arabidopsis Research, World Scientific Press (1992). Genomic DNA is digested with a restriction endonuclease and ligated overnight. After ligation, the DNA is transformed into competent E. coli strain XL-1 Blue, DH10B, DH5 alpha, or the like, and colonies are selected on semi-solid medium containing ampicillin. Resistant colonies are picked into liquid medium with ampicillin and grown overnight. Plasmid DNA is isolated and digested with the rescue enzyme and analyzed on agarose gels containing ethidium bromide for visualization. Plasmids that represent different size classes are sequenced using primers that flank the plant DNA portion of the rescue element and the sequence is analyzed to determine what portion is plant DNA and what gene has been disrupted. The plasmid rescue is validated via PCR of template genomic DNA from a heterozygote for the insertion mutation. The experiment uses a primer anchored in the predicted flanking sequence and a primer in the T-DNA insertion. Finding a PCR product of the appropriate size, based on the sequence of the plasmid rescue clone confirms a valid rescue. Alternatively, Southern blot analysis with a probe that detects the relevant region of Arabidopsis DNA in genomic DNA from a heterozygote for the insertion mutation can be used to confirm the plasmid rescue results.

Example 4 Transposon or T-DNA Border Isolation by TAIL-PCR

[0099] Arabidopsis genomic DNA is isolated according to Reiter et al. in Methods in Arabidopsis Research, World Scientific Press (1992) or using the Nucleon PhytoPure™ Plant DNA isolation kit (Amersham International plc, Buckinghamshire, England) or the Puregene DNA isolation kit (Gentra Systems, Minneapolis, Minn.). Fragments of genomic DNA flanking the borders of the transposon or T-DNA are isolated using the TAIL-PCR technique (Liu et al. (1995) Plant J., 8:457-463; Liu and Whittier (1995), Genomics, 25:674-681). Three sets of 12 TAIL-PCR reactions, referred to as the primary, secondary and tertiary reactions, are performed. In each reaction, one arbitrary degenerate primer and one transposon-specific or T-DNA-specific primer are used. The arbitrary degenerate primer is chosen from among seven primers, LWAD 1, CA50, CA51, CA52, CA53, CA54, and CA55 (Table 1), which are used to prime the genomic DNA flanking the insertion. Alternatively, less than 12 TAIL-PCR reactions are done using fewer arbitrary degenerate primers. These degenerate primers are used in combination with two sets of three, nested, transposon-specific primers (Table 2) or T-DNA-specific primers (Table 3). The transposon-specific primers are homologous to regions of the Ds elements that lie at the outermost ends of the transposons, DS5 at the 5′ end (primers 5A, 5B, and 5C) and DS3 at the 3′ end (primers 3A, 3B, and 3C). The T-DNA-specific primers are homologous to regions of the T-DNA that lie in the borders of the T-DNAs. For the pCSA104 and pDAP101 T-DNAs, right borders are recovered with CA66 (primary primer), CA67 (secondary primer), and CA68 (tertiary primer) and left borders are recovered with JM33 (tertiary primer); JM34 (secondary primer); and JM35 (primary primer). For the pCSA110 T-DNA, right borders are recovered with QRB1 (primary primer), QRB2 (secondary primer), and QRB3 (tertiary primer) and left borders are recovered with JM33 (tertiary primer); JM34 (secondary primer); and JM35 (primary primer). For the pPCVICEn4HPT (Hayashi et al. (1992), Science, 258:1350-1353) and pSKI015 (Weigel et al. (2000) Plant Physiol. 122:1003-1014) T-DNAs, left borders are recovered with SKI1 (primary primer), SKI2 (secondary primer), and SKI3 (tertiary primer). When the degenerate and nested primer pairs are used in a series of low and high-stringency PCR amplifications, as described in the TAIL-PCR protocol (Liu and Whittier (1995), Genomics, 25:674-681), DNA fragments are produced that correspond to the genomic DNA that is directly adjacent to the transposon or T-DNA insertion. The nucleic acid sequences of the PCR products from the tertiary TAIL-PCR reactions are then determined by standard molecular biology techniques. The resulting sequences are analyzed for the presence of non-Ds transposon or non-T-DNA vector sequence.

[0100] To confirm the integrity of the resultant products, PCR primers specific to the flanking genomic region are designed and used in conjunction with the tertiary nested primer in a PCR reaction, to confirm the transposon or T-DNA insertion point within the genomic DNA. Finding a PCR product of the appropriate size, based on the sequence of the TAIL-PCR clone confirms a valid rescue. TABLE 1 Arbitrary Degenerate Primers SEQ ID NO: Primer Degen. Primer Sequence 49 LWAD1 1026 ngt tgw gna twt sgw gnt 50 CA50 128 ngt cga swg ana wga a 51 CA51 128 tgw gna gsa nca sag a 52 CA52 128 agw gna gwa nca wag g 53 CA53 256 stt gnt ast nct ntg c 54 CA54 64 ntc gas twt sgw gtt 55 CA55 256 wgt gna gwa nca nag a

[0101] TABLE 2 Nested Primers For Ds Lines SEQ ID NO: Primer Primer Sequence 56 5A actagctctaccgtttccgtttccgtttac 57 5B ttacctcgggttcgaaatcgatcgggataa 58 5C aaaatcggttatacgataacggtcggtacggga 59 3A gggtcttgcggatctgaatatatgttttcatgtgtg 60 3B taccgaagaaaaataccggttcccgtccgatttcgac 61 3C ggatcgtatcggttttcgattaccgtatttatcc

[0102] TABLE 3 Nested Primers For T-DNA Lines SEQ ID NO: Primer Primer Sequence 62 CA66 att agg cac ccc agg ctt tac act tta tg 63 CA67 gta tgt tgt gtg gaa ttg tga gcg gat aac 64 CA68 taa caa ttt cac aca gga aac agc tat gac 65 JM33 tag cat ctg aat ttc ata acc aat ctc gat aca c 66 JM34 gct tcc tat tat atc ttc cca aat tac caa tac a 67 JM35 gcc ttt tca gaa atg gat aaa tag cct tgc ttc c 68 QRB1 caa act agg ata aat tat cgc gcg cgg tgt ca 69 QRB2 ggt gtc atc tat gtt act aga tcg gga att ga 70 QRB3 cgc cat ggc ata tgc tag cat gca taa ttc 71 SKI1 aat tgg taa tta ctc ttt ctt ttc ctc cat att ga 72 SKI2 ata ttg acc atc ata ctc att gct gat cca t 73 SKI3 tga tcc atg tag att tcc cgg aca tga a

Example 5 Transposon or T-DNA Border Isolation by TAIL2k PCR

[0103] Arabidopsis genomic DNA is isolated according to Reiter et al. in Methods in Arabidopsis Research, World Scientific Press (1992) or using the Nucleon PhytoPure™ Plant DNA isolation kit (Amersham International plc, Buckinghamshire, England) or the Puregene DNA isolation kit (Gentra Systems, Minneapolis, Minn.). Fragments of genomic DNA flanking the borders of the transposon or T-DNA are isolated using the TAIL2k PCR technique. Two sets of 12 TAIL-PCR reactions, referred to as the primary and secondary reactions, are performed. In each reaction, one arbitrary degenerate primer and one transposon-specific or T-DNA-specific primer are used. The arbitrary degenerate primer is selected from among six primers; CA50, CA51, CA52, CA53, CA54, and CA55 (Table 1), which are used to prime the genomic DNA flanking the insertion. Alternatively, less than 12 TAIL-PCR reactions are done using fewer arbitrary degenerate primers. These degenerate primers are used in combination with two sets of two, nested, transposon-specific primers (Table 2) or T-DNA-specific primers (Table 3). The transposon-specific primers are homologous to regions of the Ds elements that lie at the outermost ends of the transposons, DS5 at the 5′ end (primers 5A, 5B, and 5C) and DS3 at the 3′ end (primers 3A, 3B, and 3C). The T-DNA-specific primers are homologous to regions of the T-DNA that lie in the borders of the T-DNAs. For the pCSA104 and pDAP101 T-DNAs, right borders are recovered with CA66 (primary primer), CA67 (secondary primer), and CA68 (sequencing primer) and left borders are recovered with JM33 (sequencing primer), JM34 (secondary primer), and JM35 (primary primer). Primers CA66, CA67, and CA68 are also known as RB1, RB2, and RB3, respectively. Primers JM35, JM34, and JM33 are also known as LB1, LB2, and LB3, respectively. For the pCSA110 T-DNA, right borders are recovered with QRB1 (primary primer), QRB2 (secondary primer), and QRB3 (sequencing primer) and left borders are recovered with JM33 (sequencing primer); JM34 (secondary primer); and JM35 (primary primer). For the pPCVICEn4HPT (Hayashi et al. (1992), Science, 258:1350-1353) and pSKI1015 (Weigel et al. (2000) Plant Physiol. 122:1003-1014) T-DNAs, left borders are recovered with SKI1 (primary primer), SK12 (secondary primer), and SKI3 (sequencing primer). When the degenerate and nested primer pairs are used in a series of low and high-stringency PCR amplifications, as described in the TAIL-PCR protocol (Liu and Whittier (1995), Genomics, 25:674-681), DNA fragments are produced that correspond to the genomic DNA that is directly adjacent to the transposon or T-DNA insertion. TAIL2k-PCR differs from the original TAIL-PCR protocol by the elimination of the tertiary PCR and modification of the secondary PCR. The cycling conditions used in the secondary reaction are modified to include 5 high annealing temperature cycles (64 degrees C.) at the beginning, three additional so-called super cycles, and five additional low annealing temperature cycles (44 degrees C.) at the end of the reaction. The melting and extension times are the same as all other TAIL-PCR reactions. Additionally, the reaction volume is increased to 40 microliters. The nucleic acid sequences of the PCR products from the secondary TAIL2k-PCR reactions are then determined by standard molecular biology techniques. The resulting sequences are analyzed for the presence of non-Ds transposon or non-T-DNA vector sequence.

[0104] To confirm the integrity of the resultant products, PCR primers specific to the flanking genomic region are designed and used in conjunction with the tertiary nested primer in a PCR reaction, to confirm the transposon or T-DNA insertion point within the genomic DNA. Finding a PCR product of the appropriate size, based on the sequence of the TAIL2k-PCR sequencing result confirms a valid rescue.

Example 6 Identification of Both Borders of a T-DNA or Ds Insertion

[0105] If the results of border rescue provide information on only one of the two borders for an insertion in a given line, additional experiments are performed to identify the second border. These experiments are necessary to show that a single gene has been disrupted in a given line. In some cases, an insertion can affect more than a single gene due to a chromosomal deletion or rearrangement. In those cases, additional experiments are required to identify which of the affected genes is responsible for the lethal phenotype.

[0106] When both borders of an insertion are not recovered, primers are designed to isolate a PCR product that will provide information on the location of the missing border. Three primers are chosen in Arabidopsis genomic DNA on the opposite side of the insertion about one, two, and five kb away from the insertion point; the primers point towards the expected second border. Long PCR conditions (Advantage 2, Clontech) are then employed following the manufacturer's directions to amplify the relevant region from genomic DNA isolated from a heterozygote for the lethal mutation. PCR reactions are performed using appropriate pairs of genomic and T-DNA or Ds border primers. Finding a PCR product of the appropriate size, based on the sequence of the TAIL-PCR clone confirms a valid rescue of the second border. In some cases, the PCR product is directly sequenced to determine the exact insertion point.

[0107] If the second border is not recovered with this method, an additional set of PCR reactions are preformed. In these experiments, the genomic primers are paired with a series of internal T-DNA or Ds primers designed at about one kb intervals in both orientations across the entire T-DNA or Ds vector sequence. Finding a PCR product of the appropriate size, based on the sequence of the TAIL-PCR clone confirms a valid rescue of the second border. In some cases, the PCR product is directly sequenced to determine the exact insertion point. Any borders recovered with this approach are classified as abnormal because they lack the ends of the Ds transposon or the expected 24 bp T-DNA imperfect repeat characteristic of right and left borders.

Example 7 Identification of Insertion Points for Lines with Lethal Phenotypes

[0108] For each line with a lethal phenotype, the sequences of the borders of the insertion are determined and the insertion points in the Arabidopsis genome are deduced. For Ds insertion lines, PCR products are obtained from the Ds3 and Ds5 borders. For T-DNA lines, PCR products or plasmid rescue clones are obtained from left (LB), right (RB), or abnormal (AB) borders. These sequences are used in BLASTn searches against nucleotide databases (Altschul et al. (1990) J Mol. Biol. 215:403-410; Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402). The results are summarized in Table 4. Ds line names begin with ET or GT; T-DNA line names are numbers. The insertion point (Insert Pt.) and the direction of the flanking sequence (Dir.) either up (U) or down (D) in the genome section is noted. Often, small deletions or duplications of genomic DNA accompany the insertion of a T-DNA or Ds transposon.

[0109] The gene that has been inactivated in a given line with a lethal phenotype is determined from the insertion points for that line. Often, the precise location of an ORF for a given gene is not known, but predictions are available in genome sections deposited in GenBank. The precise boundaries of that ORF is determined as described in Example 7. TABLE 4 Insertion Points For Lines With Lethal Phenotypes Genome Insert Gene Line # Border Section Acc. # Pt. Dir. 33 62536 LB T29H11 AL049659 96460 D 33 62536 LB T29H11 AL049659 96470 U 33 94990 RB T29H11 AL049659 95809 U 33 94990 LB T29H11 AL049659 95739 D 417 75602 LB ATCHRIV71 AL161575 139127 U 417 75602 LB ATCHRIV71 AL161575 138979 D 510 48882 RB MQC12 AB024036 60578 D 510 48882 LB MQC12 AB024036 60632 U 671 98507 LB T20K9 AC004786 35184 D 671 98507 LB T20K9 AC004786 35209 U 930 11129 AB T1E22 AL162874 7988 D 930 11129 LB T1E22 AL162874 8015 U 931 11206 LB YUP8H12R AC002986 12740 U 931 11206 LB YUP8H12R AC002986 12716 D 955 11833 LB MDC12 AB008265 1978 D 955 11833 RB MDC12 AB008265 2023 U 955 16089 LB MDC12 AB008265 2141 D 955 16089 RB MDC12 AB008265 2367 U 955 123345 LB MDC12 AB008265 2340 D 955 123563 LB MDC12 AB008265 1936 U 955 123563 LB MDC12 AB008265 1924 D 962 16696 LB F7A10 AC027034 15603 D 962 16696 LB F7A10 AC027034 15663 U 1019 35345 LB ATCHRIV30 AL161518 60369 D 1019 35345 LB ATCHRIV30 AL161518 60398 U 1019 41510 LB ATCHRIV30 AL161518 57633 U 1019 41510 LB ATCHRIV30 AL161518 57585 D 1159 21281 AB K9I9 AB013390 32147 D 1159 21281 LB K9I9 AB013390 32180 U 1380 70615 AB T8K14 AC007202 36203 U 1380 70615 LB T8K14 AC007202 34862 D 1413 81281 RB ATCHRIV12 AL161500 52649 U 1413 81281 RB ATCHRIV12 AL161500 52462 D 1425 57819 LB ATCHRIV53 AL161553 27669 D 1425 57819 LB ATCHRIV53 AL161553 27693 U 1425 96886 LB ATCHRIV53 AL161553 27705 U 1425 96886 RB ATCHRIV53 AL161553 27695 D 1456 11627 RB T24P22 AC084242 7580 U 1456 11627 RB T24P22 AC084242 7539 D 1457 62024 LB T8F5 AC004512 62716 U 1457 62024 LB T8F5 AC004512 62647 D 3209 83826 LB MDC11 AB024034 46358 D 3537 40773 RB T13O15 AC010870 45038 D 3537 40773 LB T13O15 AC010870 45041 U 7726 127024 RB K24M9 AP001303 26220 U 7726 127024 RB K24M9 AP001303 26175 D 11197 104603 LB T20P8 AC005623 19718 D 11197 104603 LB T20P8 AC005623 19869 U 12258 118669 LB T14G11 AC002341 39667 U 12258 118669 RB T14G11 AC002341 39667 D 19814 105512 LB MJE7 AB020745 39263 D 19814 105512 LB MJE7 AB020745 39265 U 21858 131461 LB ATCHRIV80 AL161584 124842 U 21858 131461 LB ATCHRIV80 AL161584 124806 D 25358 113413 LB K12B20 AB018107 31416 U 25358 113413 LB K12B20 AB018107 31389 D 25358 119013 LB K12B20 AB018107 29254 U 25358 119013 LB K12B20 AB018107 29124 D 28011 23518 LB MNB8 AB018116 45722 U 28011 23518 LB MNB8 AB018116 45703 D

Example 8 Identification of cDNAs for Essential Genes

[0110] A cDNA for a gene identified as essential is identified using a variety of approaches. This information enables the ORF for a given gene to be identified and used for other experiments including expression of the corresponding protein in heterologous systems.

[0111] If there is a full-length cDNA deposited in GenBank or published elsewhere, that sequence may be checked independently using methods described below. Alternatively, the sequence may be considered to be correct.

[0112] In some cases, there are published EST sequences that can be assembled to cover the entire ORF from start codon to stop codon. This sequence may be checked independently using methods described below or it may be considered to be correct.

[0113] Often part of the cDNA is published and this information can be used to identify the entire ORF. If the 5′ end containing the start codon is known, 3′ RACE is performed to identify the remainder of the cDNA. If the 3′ end containing the stop codon is known, 5′ RACE is performed to identify the remainder of the cDNA. If both the 5′ and the 3′ ends are known, but the sequence between the two ends of the cDNA is not known, PCR is performed with primers hybridizing to each end of the cDNA. In all three of these cases, PCR is performed using template DNA from a GeneRacer (Invitrogen) or a Marathon (Clontech) cDNA library prepared from RNA isolated from seedling tissue. A resulting PCR product is TA-cloned (Original TA-Cloning kit, Invitrogen) and sequenced.

[0114] If no part of the cDNA is published, the cDNA is identified by starting from gene model predictions in the annotation for genomic clones or elsewhere. To identify the ORF, primers are designed to the 5′ and 3′ ends of the predicted ORF. PCR is performed using template DNA from a cDNA library prepared from seedling tissue or the pFL61 Arabidopsis cDNA library (Minet et al. (1992) Plant J. 2: 417-422). The resulting PCR product is TA-cloned (Original TA-Cloning kit, Invitrogen) and sequenced. Alternatively, 5′ and 3′ RACE are performed with primers predicted by gene models to be in exons. PCR is performed using template DNA from a GeneRacer (Invitrogen) or a Marathon (Clontech) cDNA library prepared from RNA isolated from seedling tissue. A resulting PCR product is TA-cloned (Original TA-Cloning kit, Invitrogen) and sequenced.

[0115] If the cDNA sequence is the same as the sequence predicted in the GenBank annotation, the experiments confirm for the first time the actual ORF. If the cDNA sequence is not the same as the sequence predicted in the GenBank annotation, the experiments identify for the first time the actual ORF. In some cases, more than one cDNA sequence is found for a given gene and both sequences are included in this application.

Example 9 Description of Essential Genes

[0116] The putative function of the protein encoded by each essential gene is determined from analysis of the ORF in each cDNA. Information from the relevant Arabidopsis genomic section deposited in GenBank is used as a starting point to explore the function of a given gene. This analysis also includes BLAST searches (Altschul et al. (1990) J. Mol. Biol. 215:403-410; Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402) of sequence databases to identify similar proteins. Table 5 describes the putative functions for the essential genes discovered in this application. TABLE 5 Putative Functions For Essential Genes GENE SEQ ID ID NO Putative Function & Similar Genes References 33 1-2 unknown function, similar genes Bycroft, M. et al. (1997) Cell 88: 235-242 identified in rice (BAB89963.1), tomato, potato, medicago, sorghum, & barley ESTs; may contain S1 RNA binding domain (PFAM 00575) 417 3-4 unknown function, contains 2 Ach, R. A. et al. (1997) Plant Cell 9: 1595-1606 WD40, G-beta repeat domains (PFAM 00400) 510 5-6 similar to glucan (1,4-alpha-), Preiss, J. & Sivak, M. N. (1998) Genet Eng branching enzymes (glycogen or 20: 177-223; Blauth, S. L. et al. (2001) starch branching enzymes) Plant Physiol 125: 1396-1405; Thon, V. J. et al. (1992) J Biol Chem 267: 15224-15228 671 7-8 may contain a MMR_HSR1, Vernet, C. et al. (1994) Mamm Genome GTPase of unknown function 5: 100-105 domain (PFAM 01926), similar to B. subtilis & E. coli engB 930  9-10 similar to E. coli 3 prime −5 prime Marujo, P. E. et al. (2000) RNA 6: 1185-1193; exoribonuclease, RNase R (aka Mohanty, B. K. & Kushner, S R. vacB) & other members of RNase II (2000) Mol Microbiol 36: 982-994; Mian, I. S. family (1997) Nucleic Acids Res. 25: 3187-3195 931 11-12 similar to D. melanogaster Majumdar, A. et al. (1997) Genes Dev strawberry notch (sno) & human 11: 1341-1353; Aasland, R. et al. (1995) MOP-3; contains a PHD-type zinc Trends Biochem Sci 20: 56-59 finger 955 13-14 unknown function, similar to Vigna Kaga, A. & Ishimoto, M. (1998) Mol Gen radiata Bng110 (BAB82451) Genet 258: 378-384 962 15-16 putative n-calpain-1 large subunit; Kidd, V. J. et al. (2000) Semin Cell Dev similar to maize DEK1 & mouse Biol. 11: 191-201; Donkor, I. O. (2000) calpain 3; calcium activated neutral Curr Med Chem. 7: 1171-1188; Wang, K. K. protease (2000) Trends Neurosci. 23: 20-26; Lid, S. E. et al. (2002) Proc Natl Acad Sci USA. 99: 5460-5465 1019 17-18 similar to human AdoMet-binding Bokar, J. A. et al. (1997) RNA 3: 1233-1247; subunit of (N6-adenosine)- Shah, J. C. & Clancy M. J. (1992) methyltransferase (AAG13956) & Mol Cell Biol 12: 1078-1086; Finnegan, E. J. S. cerevisiae IME4 (aka SPO8) & Kovac, K. A. (2000) Plant Mol Biol. 43: 189-201 1159 19-20 unknown function, similar to Bentolila, S. et al. (2002) Proc Natl Acad petunia Rf-PPR592 fertility restorer Sci USA 99: 10887-10892; Small, I. D. & protein, contains PPR domains Peeters, N. (2000) Trends Biochem Sci (PFAM 01535), member of large 25: 46-47; Fisk, D. G. et al. (1999) EMBO J gene family in Arabidopsis 18: 2621-2630; Coffin, J. W. (1997) Curr. Genet. 32: 273-280 1380 21-22 unknown function, similar to Bentolila, S. et al. (2002) Proc Natl Acad petunia Rf-PPR592 fertility restorer Sci USA 99: 10887-10892; Small, I. D. & protein, contains a SMR (Small Peeters, N. (2000) Trends Biochem Sci MutS-related) domain and PPR 25: 46-47; Fisk, D. G. et al. (1999) EMBO J repeats (PFAM 01535), member of 18: 2621-2630; Coffin, J. W. (1997) Curr. large gene family in Arabidopsis Genet. 32: 273-280; Moreira, D. & Philippe, H. (1999) Trends Biochem Sci 24: 298-300 1413 23-24 putative leucyl tRNA synthetase Thompson, L. H. et al. (1973) Proc Natl Acad Sci USA 70: 3094-3098; Hartlein, M. & Madern, D. (1987) Nucleic Acids Res 15: 10199-10210; Labouesse, M. (1990) Mol Gen Genet 224: 209-221 1425 25-26 similar to RNA splicing factors Gray, T. A. et al. (1999) Nucleic Acids Res small nuclear ribonucleoprotein B, 27: 4577-4584; Ozcelik, T. et al. (1992) B′, & N in humans and other Nat. Genet. 2: 265-269 animals 1456 27-28 putative chorismate synthase Schaller, A. et al. (1991) J Biol Chem. 266: 21434-21438; Gorlach, J. et al. (1993) Plant Mol Biol. 23: 707-716; Gorlach, J. et al. (1995) Plant J. 8: 451-456; Braun, M. et al. (1996) Planta 200: 64-70 1457 29-30 putative ABC transporter Stacey, G. et al. (2002) Trends Plant Sci. 7: 257-263; Smith, P. et al. (2002) Mol Cell 10: 139-149; Fath, M. J. & Kolter, R. (1993) Microbiol Rev. 57: 995-1017 3209 31-32 unknown function, may contain a Ach, R. A. et al. (1997) Plant Cell 9: 1595-1606; WD40, G-beta repeat domain Garcia-Lozano, J. R. et al. (1997) (PFAM 00400), weak similarity to Clin Exp Immunol 107: 501-506 human autoantigen (RCD-8) 3537 33-34 similar to maize CRS1, required for Till, B. et al. (2001) RNA 7: 1227-1238; splicing atpF group II intron in Vogel, J., Borner, T. et al. (1999) Nucleic chloroplasts, which has a seedling Acids Res. 27: 3866-3874 lethal phenotype 7726 35-36 unknown function, contains single- Meyer, R. R. & Laine, P. S. (1990) stranded binding protein domain Microbiol Rev 54: 342-380; Meyer, R. R. et (PFAM 00436), weak similarity to al. (1979) Proc Natl Acad Sci USA. E. coli SSB2 76: 1702-1705; Ruvolo, P. P. et al. (1991) Proteins 9: 120-134 11197 37-38 Arabidopsis COP9 complex subunit Fu, H. et al. (2001) EMBO J. 20: 7096-7107; CSN2, contains a PCI/PINT domain Bech-Otschir, D. et al. (2002) J. Cell (PFAM 01399) Sci. 115: 467-473; Chamovitz, D. & Glickman, M. (2002) Curr Biol. 12: R232 12258 39-40 unknown function, contains a none DUF231 domain (PFAM 03005), member of large gene family in Arabidopsis 19814 41-42 unknown function none 21858 43-44 unknown function, contains PPR Small, I. D. & Peeters, N. (2000) Trends domains (PFAM 01535), member of Biochem Sci 25: 46-47; Fisk, D. G. et al. large gene family in Arabidopsis (1999) EMBO J 18: 2621-2630; Coffin, J. W. (1997) Curr. Genet. 32: 273-280 25358 45-46 similar to Xenopus chromosome Cabello, O. A. et al. (2001) Mol Biol Cell condensation protein XCAP-G, 130 kD 12: 3527-3537; Kimura, K. et al. (2001) J subunit of the 13S condensin Biol Chem 276: 5417-5420 complex, & human hCAP-G 28011 47-48 unknown function, contains Lamb, J. R. et al. (1995) Trends Biochem tetratricopeptide repeat (TPR) Sci 20: 257-259; Das, A. K. et al. (1998) domains (PFAM 00515) EMBO J 17: 1192-1199; Goeb, l M. & Yanagida, M. (1991) Trends Biochem Sci 16: 173-177

Example 10 Expression of Recombinant Essential Proteins in E. coli

[0117] The coding region of each of the essential proteins, corresponding to cDNA clones of odd-numbered SEQ ID NO: 1-96, is subcloned into an appropriate expression vector, and transformed into E. coli using the manufacturer's conditions. Specific examples include plasmids such as pBluescript (Stratagene, La Jolla, Calif.), pFLAG (International Biotechnologies, Inc., New Haven, Conn.), and pTrcHis (Invitrogen, La Jolla, Calif.). E. coli is cultured, and expression of the essential protein is confirmed. Recombinant protein is isolated using standard techniques.

Example 11 In Vitro Binding Assays

[0118] Recombinant protein for each of the essential genes described in this application is obtained, for example, according to Example 10. The protein is immobilized on chips appropriate for ligand binding assays using techniques that are well known in the art. The protein immobilized on the chip is exposed to sample compound in solution according to methods well know in the art. While the sample compound is in contact with the immobilized protein, measurements capable of detecting protein-ligand interactions are conducted. Examples of such measurements are SELDI, biacore and FCS, described above. Compounds found to bind the protein are readily discovered in this fashion and are subjected to further characterization.

[0119] The above-disclosed embodiments are illustrative. This disclosure of the invention will place one skilled in the art in possession of many variations of the invention. All such obvious and foreseeable variations are intended to be encompassed by the present invention.

1 73 1 2007 DNA Arabidopsis thaliana CDS (1)..(2007) 1 atg cag att tgc caa acc aag ctc aat ttc act ttc cct aat ccc aca 48 Met Gln Ile Cys Gln Thr Lys Leu Asn Phe Thr Phe Pro Asn Pro Thr 1 5 10 15 aac cct aat ttc tgc aaa ccc aaa gct ctt caa tgg tca ccg cct cgt 96 Asn Pro Asn Phe Cys Lys Pro Lys Ala Leu Gln Trp Ser Pro Pro Arg 20 25 30 cgc ata tcc ttg ctg cct tgt cgt gga ttc agc tcc gat gaa ttc cca 144 Arg Ile Ser Leu Leu Pro Cys Arg Gly Phe Ser Ser Asp Glu Phe Pro 35 40 45 gtc gac gaa acc ttc ctc gag aaa ttc gga cca aag gac aaa gac aca 192 Val Asp Glu Thr Phe Leu Glu Lys Phe Gly Pro Lys Asp Lys Asp Thr 50 55 60 gaa gat gaa gct cga cga cgt aac tgg atc gaa cgt ggt tgg gct cca 240 Glu Asp Glu Ala Arg Arg Arg Asn Trp Ile Glu Arg Gly Trp Ala Pro 65 70 75 80 tgg gaa gag att ctc aca cca gaa gct gat ttc gct cgt aaa tct ctc 288 Trp Glu Glu Ile Leu Thr Pro Glu Ala Asp Phe Ala Arg Lys Ser Leu 85 90 95 aac gaa ggt gaa gaa gtt ccg ctt caa tcg ccg gaa gcg atc gaa gcg 336 Asn Glu Gly Glu Glu Val Pro Leu Gln Ser Pro Glu Ala Ile Glu Ala 100 105 110 ttt aag atg ctg aga cca tcg tat agg aag aag aag att aag gag atg 384 Phe Lys Met Leu Arg Pro Ser Tyr Arg Lys Lys Lys Ile Lys Glu Met 115 120 125 ggg ata aca gaa gac gaa tgg tat gca aag caa ttt gag att aga ggt 432 Gly Ile Thr Glu Asp Glu Trp Tyr Ala Lys Gln Phe Glu Ile Arg Gly 130 135 140 gat aaa cca cct cct tta gaa aca tct tgg gct ggt ccg atg gtt ctt 480 Asp Lys Pro Pro Pro Leu Glu Thr Ser Trp Ala Gly Pro Met Val Leu 145 150 155 160 agg caa att ccg ccg cgt gat tgg cct ccc aga ggt tgg gaa gtt gat 528 Arg Gln Ile Pro Pro Arg Asp Trp Pro Pro Arg Gly Trp Glu Val Asp 165 170 175 agg aag gag ctg gag ttt att agg gaa gct cat aag tta atg gct gaa 576 Arg Lys Glu Leu Glu Phe Ile Arg Glu Ala His Lys Leu Met Ala Glu 180 185 190 aga gtt tgg ctt gag gat ttg gat aag gat ttg aga gtt ggt gaa gat 624 Arg Val Trp Leu Glu Asp Leu Asp Lys Asp Leu Arg Val Gly Glu Asp 195 200 205 gct act gtt gat aag atg tgt ttg gag agg ttt aag gtt ttc ttg aaa 672 Ala Thr Val Asp Lys Met Cys Leu Glu Arg Phe Lys Val Phe Leu Lys 210 215 220 caa tac aag gaa tgg gtt gaa gat aat aaa gat agg ttg gag gaa gaa 720 Gln Tyr Lys Glu Trp Val Glu Asp Asn Lys Asp Arg Leu Glu Glu Glu 225 230 235 240 tct tac aag ctc gat cag gat ttt tat ccg ggt agg agg aaa aga ggg 768 Ser Tyr Lys Leu Asp Gln Asp Phe Tyr Pro Gly Arg Arg Lys Arg Gly 245 250 255 aag gat tac gaa gat ggg atg tat gag ctt ccc ttt tac tat cca ggg 816 Lys Asp Tyr Glu Asp Gly Met Tyr Glu Leu Pro Phe Tyr Tyr Pro Gly 260 265 270 atg gtt tgt gaa ggc aca gtt acc act tta cat ctg tat cag gga gcg 864 Met Val Cys Glu Gly Thr Val Thr Thr Leu His Leu Tyr Gln Gly Ala 275 280 285 ttt gtt gac att gga ggt gtt cat gaa gga tgg gta cct ata aaa ggt 912 Phe Val Asp Ile Gly Gly Val His Glu Gly Trp Val Pro Ile Lys Gly 290 295 300 aat gac tgg ttt tgg atc cgg cat ttc ata aaa gtt ggg atg cat gtt 960 Asn Asp Trp Phe Trp Ile Arg His Phe Ile Lys Val Gly Met His Val 305 310 315 320 atc gtt gaa atc acg gca aaa aga gat cca tac cgg ttt cgg ttt ccc 1008 Ile Val Glu Ile Thr Ala Lys Arg Asp Pro Tyr Arg Phe Arg Phe Pro 325 330 335 ttg gag ttg cgc ttc gtc cat cct aac ata gat cac atg ata ttt aat 1056 Leu Glu Leu Arg Phe Val His Pro Asn Ile Asp His Met Ile Phe Asn 340 345 350 aaa ttt gac ttc cca cca ata ttc cat cgt gat ggg gat act aat cca 1104 Lys Phe Asp Phe Pro Pro Ile Phe His Arg Asp Gly Asp Thr Asn Pro 355 360 365 gat gag ata cgg cga gat tgt gga aga cct cct gaa cct aga aaa gat 1152 Asp Glu Ile Arg Arg Asp Cys Gly Arg Pro Pro Glu Pro Arg Lys Asp 370 375 380 cca gga tca aag cca gag gag gaa ggg ctg ctc tct gat cac cct tat 1200 Pro Gly Ser Lys Pro Glu Glu Glu Gly Leu Leu Ser Asp His Pro Tyr 385 390 395 400 gtc gac aag ttg tgg cag ata cat gta gct gag caa atg att ttg ggt 1248 Val Asp Lys Leu Trp Gln Ile His Val Ala Glu Gln Met Ile Leu Gly 405 410 415 gat tac gaa gct aac cct gca aaa tac gaa ggc aaa aag cta tca gaa 1296 Asp Tyr Glu Ala Asn Pro Ala Lys Tyr Glu Gly Lys Lys Leu Ser Glu 420 425 430 tta tct gat gat gaa gac ttt gat gaa caa aag gat atc gag tat ggc 1344 Leu Ser Asp Asp Glu Asp Phe Asp Glu Gln Lys Asp Ile Glu Tyr Gly 435 440 445 gaa gct tat tat aag aaa acc aaa ttg cca aaa gtg att ctg aaa acc 1392 Glu Ala Tyr Tyr Lys Lys Thr Lys Leu Pro Lys Val Ile Leu Lys Thr 450 455 460 agt gtc aag gaa ctt gac tta gag gct gca ttg acc gag cgc cag cac 1440 Ser Val Lys Glu Leu Asp Leu Glu Ala Ala Leu Thr Glu Arg Gln His 465 470 475 480 cac aat aaa cta atg atg gaa gct aaa gca aga gga gaa gga tac aaa 1488 His Asn Lys Leu Met Met Glu Ala Lys Ala Arg Gly Glu Gly Tyr Lys 485 490 495 att gac aag ctc aga cga aat ata gag atg gac gag tat gat ttc tta 1536 Ile Asp Lys Leu Arg Arg Asn Ile Glu Met Asp Glu Tyr Asp Phe Leu 500 505 510 cac tgg cgc cga tct ttg gag gaa aga gaa gca ttg ctc aga gat atc 1584 His Trp Arg Arg Ser Leu Glu Glu Arg Glu Ala Leu Leu Arg Asp Ile 515 520 525 agc tct cgg caa gca ctt ggt tta cca ttg gag gaa cca ggg agg tac 1632 Ser Ser Arg Gln Ala Leu Gly Leu Pro Leu Glu Glu Pro Gly Arg Tyr 530 535 540 aag cca gga agc ttc ttt ggg aaa gac cag tac gat cca aca agt gca 1680 Lys Pro Gly Ser Phe Phe Gly Lys Asp Gln Tyr Asp Pro Thr Ser Ala 545 550 555 560 tta tat cag tat gac tac tgg gga gag cca aag aac tca gag att agc 1728 Leu Tyr Gln Tyr Asp Tyr Trp Gly Glu Pro Lys Asn Ser Glu Ile Ser 565 570 575 aag caa gag aga atg aag gat gca cac aac aaa tcc att gtt ggg aaa 1776 Lys Gln Glu Arg Met Lys Asp Ala His Asn Lys Ser Ile Val Gly Lys 580 585 590 ggc aat gtg tgg tat gac atg tct tac gat gat gcg att aag cag aca 1824 Gly Asn Val Trp Tyr Asp Met Ser Tyr Asp Asp Ala Ile Lys Gln Thr 595 600 605 ata gag aaa aga aaa gaa ggg tct acc ttg gcg agt caa gaa gaa gaa 1872 Ile Glu Lys Arg Lys Glu Gly Ser Thr Leu Ala Ser Gln Glu Glu Glu 610 615 620 aca gag tca gag gaa gaa gaa gag gat gac gat gat ttt gac gat ttt 1920 Thr Glu Ser Glu Glu Glu Glu Glu Asp Asp Asp Asp Phe Asp Asp Phe 625 630 635 640 gac tat agc att ctg agt gat gag agc agt atc ggt tac tcg gaa cag 1968 Asp Tyr Ser Ile Leu Ser Asp Glu Ser Ser Ile Gly Tyr Ser Glu Gln 645 650 655 caa cct ctt gtt aac ggt act caa gtc ttg aca gac tga 2007 Gln Pro Leu Val Asn Gly Thr Gln Val Leu Thr Asp 660 665 2 668 PRT Arabidopsis thaliana 2 Met Gln Ile Cys Gln Thr Lys Leu Asn Phe Thr Phe Pro Asn Pro Thr 1 5 10 15 Asn Pro Asn Phe Cys Lys Pro Lys Ala Leu Gln Trp Ser Pro Pro Arg 20 25 30 Arg Ile Ser Leu Leu Pro Cys Arg Gly Phe Ser Ser Asp Glu Phe Pro 35 40 45 Val Asp Glu Thr Phe Leu Glu Lys Phe Gly Pro Lys Asp Lys Asp Thr 50 55 60 Glu Asp Glu Ala Arg Arg Arg Asn Trp Ile Glu Arg Gly Trp Ala Pro 65 70 75 80 Trp Glu Glu Ile Leu Thr Pro Glu Ala Asp Phe Ala Arg Lys Ser Leu 85 90 95 Asn Glu Gly Glu Glu Val Pro Leu Gln Ser Pro Glu Ala Ile Glu Ala 100 105 110 Phe Lys Met Leu Arg Pro Ser Tyr Arg Lys Lys Lys Ile Lys Glu Met 115 120 125 Gly Ile Thr Glu Asp Glu Trp Tyr Ala Lys Gln Phe Glu Ile Arg Gly 130 135 140 Asp Lys Pro Pro Pro Leu Glu Thr Ser Trp Ala Gly Pro Met Val Leu 145 150 155 160 Arg Gln Ile Pro Pro Arg Asp Trp Pro Pro Arg Gly Trp Glu Val Asp 165 170 175 Arg Lys Glu Leu Glu Phe Ile Arg Glu Ala His Lys Leu Met Ala Glu 180 185 190 Arg Val Trp Leu Glu Asp Leu Asp Lys Asp Leu Arg Val Gly Glu Asp 195 200 205 Ala Thr Val Asp Lys Met Cys Leu Glu Arg Phe Lys Val Phe Leu Lys 210 215 220 Gln Tyr Lys Glu Trp Val Glu Asp Asn Lys Asp Arg Leu Glu Glu Glu 225 230 235 240 Ser Tyr Lys Leu Asp Gln Asp Phe Tyr Pro Gly Arg Arg Lys Arg Gly 245 250 255 Lys Asp Tyr Glu Asp Gly Met Tyr Glu Leu Pro Phe Tyr Tyr Pro Gly 260 265 270 Met Val Cys Glu Gly Thr Val Thr Thr Leu His Leu Tyr Gln Gly Ala 275 280 285 Phe Val Asp Ile Gly Gly Val His Glu Gly Trp Val Pro Ile Lys Gly 290 295 300 Asn Asp Trp Phe Trp Ile Arg His Phe Ile Lys Val Gly Met His Val 305 310 315 320 Ile Val Glu Ile Thr Ala Lys Arg Asp Pro Tyr Arg Phe Arg Phe Pro 325 330 335 Leu Glu Leu Arg Phe Val His Pro Asn Ile Asp His Met Ile Phe Asn 340 345 350 Lys Phe Asp Phe Pro Pro Ile Phe His Arg Asp Gly Asp Thr Asn Pro 355 360 365 Asp Glu Ile Arg Arg Asp Cys Gly Arg Pro Pro Glu Pro Arg Lys Asp 370 375 380 Pro Gly Ser Lys Pro Glu Glu Glu Gly Leu Leu Ser Asp His Pro Tyr 385 390 395 400 Val Asp Lys Leu Trp Gln Ile His Val Ala Glu Gln Met Ile Leu Gly 405 410 415 Asp Tyr Glu Ala Asn Pro Ala Lys Tyr Glu Gly Lys Lys Leu Ser Glu 420 425 430 Leu Ser Asp Asp Glu Asp Phe Asp Glu Gln Lys Asp Ile Glu Tyr Gly 435 440 445 Glu Ala Tyr Tyr Lys Lys Thr Lys Leu Pro Lys Val Ile Leu Lys Thr 450 455 460 Ser Val Lys Glu Leu Asp Leu Glu Ala Ala Leu Thr Glu Arg Gln His 465 470 475 480 His Asn Lys Leu Met Met Glu Ala Lys Ala Arg Gly Glu Gly Tyr Lys 485 490 495 Ile Asp Lys Leu Arg Arg Asn Ile Glu Met Asp Glu Tyr Asp Phe Leu 500 505 510 His Trp Arg Arg Ser Leu Glu Glu Arg Glu Ala Leu Leu Arg Asp Ile 515 520 525 Ser Ser Arg Gln Ala Leu Gly Leu Pro Leu Glu Glu Pro Gly Arg Tyr 530 535 540 Lys Pro Gly Ser Phe Phe Gly Lys Asp Gln Tyr Asp Pro Thr Ser Ala 545 550 555 560 Leu Tyr Gln Tyr Asp Tyr Trp Gly Glu Pro Lys Asn Ser Glu Ile Ser 565 570 575 Lys Gln Glu Arg Met Lys Asp Ala His Asn Lys Ser Ile Val Gly Lys 580 585 590 Gly Asn Val Trp Tyr Asp Met Ser Tyr Asp Asp Ala Ile Lys Gln Thr 595 600 605 Ile Glu Lys Arg Lys Glu Gly Ser Thr Leu Ala Ser Gln Glu Glu Glu 610 615 620 Thr Glu Ser Glu Glu Glu Glu Glu Asp Asp Asp Asp Phe Asp Asp Phe 625 630 635 640 Asp Tyr Ser Ile Leu Ser Asp Glu Ser Ser Ile Gly Tyr Ser Glu Gln 645 650 655 Gln Pro Leu Val Asn Gly Thr Gln Val Leu Thr Asp 660 665 3 1161 DNA Arabidopsis thaliana CDS (1)..(1161) 3 atg agt aag aga cct ccg cct gat cca gtg gct gtg ctt aga ggg cac 48 Met Ser Lys Arg Pro Pro Pro Asp Pro Val Ala Val Leu Arg Gly His 1 5 10 15 cgt cac tct gtt atg gat gtt tcc ttc cat cca tca aaa tct ctg ctt 96 Arg His Ser Val Met Asp Val Ser Phe His Pro Ser Lys Ser Leu Leu 20 25 30 ttc act ggt tca gct gat gga gag ttg agg att tgg gat act att cag 144 Phe Thr Gly Ser Ala Asp Gly Glu Leu Arg Ile Trp Asp Thr Ile Gln 35 40 45 cac cgt gca gtt tct tct gct tgg gct cat agt aga gca aat ggg gtt 192 His Arg Ala Val Ser Ser Ala Trp Ala His Ser Arg Ala Asn Gly Val 50 55 60 tta gct gta gca gct agt ccc tgg tta gga gag gac aag att atc agc 240 Leu Ala Val Ala Ala Ser Pro Trp Leu Gly Glu Asp Lys Ile Ile Ser 65 70 75 80 cag gga aga gat gga act gtt aag tgt tgg gat att gaa gat ggt ggt 288 Gln Gly Arg Asp Gly Thr Val Lys Cys Trp Asp Ile Glu Asp Gly Gly 85 90 95 cta tca aga gac cca tta cta ata ctt gag aca tgc gcc tat cat ttt 336 Leu Ser Arg Asp Pro Leu Leu Ile Leu Glu Thr Cys Ala Tyr His Phe 100 105 110 tgc aag ttc tct ctt gtt aag aag cct aag aat tct ctg caa gaa gct 384 Cys Lys Phe Ser Leu Val Lys Lys Pro Lys Asn Ser Leu Gln Glu Ala 115 120 125 gaa agt cac tca cgg ggt tgt gat gag cag gat ggt gga gac aca tgt 432 Glu Ser His Ser Arg Gly Cys Asp Glu Gln Asp Gly Gly Asp Thr Cys 130 135 140 aat gtt caa att gct gat gat agt gaa cga tct gag gag gat tct ggt 480 Asn Val Gln Ile Ala Asp Asp Ser Glu Arg Ser Glu Glu Asp Ser Gly 145 150 155 160 ttg ctt cag gat aaa gat cat gct gaa ggt acc act ttt gta gca gta 528 Leu Leu Gln Asp Lys Asp His Ala Glu Gly Thr Thr Phe Val Ala Val 165 170 175 gta ggg gag caa cct act gag gta gag ata tgg gat ctc aat acc gga 576 Val Gly Glu Gln Pro Thr Glu Val Glu Ile Trp Asp Leu Asn Thr Gly 180 185 190 gac aag ata ata cag ctt cct caa agt agc cct gac gaa tcc ccc aat 624 Asp Lys Ile Ile Gln Leu Pro Gln Ser Ser Pro Asp Glu Ser Pro Asn 195 200 205 gct tct acc aaa gga aga gga atg tgt atg gct gtt caa ttg ttt tgc 672 Ala Ser Thr Lys Gly Arg Gly Met Cys Met Ala Val Gln Leu Phe Cys 210 215 220 cct cct gaa tca cag ggt ttc tta cat gtc tta gct ggt tat gag gat 720 Pro Pro Glu Ser Gln Gly Phe Leu His Val Leu Ala Gly Tyr Glu Asp 225 230 235 240 ggt tca ata ctt ctt tgg gat ata cgc aat gct aag att cct cta aca 768 Gly Ser Ile Leu Leu Trp Asp Ile Arg Asn Ala Lys Ile Pro Leu Thr 245 250 255 agt gtg aag ttt cat tca gag cca gtt tta agt ctt agt gtt gca tca 816 Ser Val Lys Phe His Ser Glu Pro Val Leu Ser Leu Ser Val Ala Ser 260 265 270 tca tgt gat gga gga atc tca gga ggg gca gat gac aaa att gtg atg 864 Ser Cys Asp Gly Gly Ile Ser Gly Gly Ala Asp Asp Lys Ile Val Met 275 280 285 tat aat ctc aac cat tca act ggt tct tgt aca ata agg aaa gag att 912 Tyr Asn Leu Asn His Ser Thr Gly Ser Cys Thr Ile Arg Lys Glu Ile 290 295 300 act ttg gaa agg ccc ggt gta tcg ggc acc tca att cga gtc gat ggg 960 Thr Leu Glu Arg Pro Gly Val Ser Gly Thr Ser Ile Arg Val Asp Gly 305 310 315 320 aaa att gca gcc acg gct ggt tgg gac cac aga ata cga gtc tat aat 1008 Lys Ile Ala Ala Thr Ala Gly Trp Asp His Arg Ile Arg Val Tyr Asn 325 330 335 tac cgc aaa gga aat gct cta gca ata cta aag tac cac cga gca acg 1056 Tyr Arg Lys Gly Asn Ala Leu Ala Ile Leu Lys Tyr His Arg Ala Thr 340 345 350 tgc aac gct gtg tcc tac tct cca gac tgt gag cta atg gcg tct gcg 1104 Cys Asn Ala Val Ser Tyr Ser Pro Asp Cys Glu Leu Met Ala Ser Ala 355 360 365 tct gaa gat gca act gtt gct ctc tgg aaa cta tat cct ccc cat aaa 1152 Ser Glu Asp Ala Thr Val Ala Leu Trp Lys Leu Tyr Pro Pro His Lys 370 375 380 tct ctc tga 1161 Ser Leu 385 4 386 PRT Arabidopsis thaliana 4 Met Ser Lys Arg Pro Pro Pro Asp Pro Val Ala Val Leu Arg Gly His 1 5 10 15 Arg His Ser Val Met Asp Val Ser Phe His Pro Ser Lys Ser Leu Leu 20 25 30 Phe Thr Gly Ser Ala Asp Gly Glu Leu Arg Ile Trp Asp Thr Ile Gln 35 40 45 His Arg Ala Val Ser Ser Ala Trp Ala His Ser Arg Ala Asn Gly Val 50 55 60 Leu Ala Val Ala Ala Ser Pro Trp Leu Gly Glu Asp Lys Ile Ile Ser 65 70 75 80 Gln Gly Arg Asp Gly Thr Val Lys Cys Trp Asp Ile Glu Asp Gly Gly 85 90 95 Leu Ser Arg Asp Pro Leu Leu Ile Leu Glu Thr Cys Ala Tyr His Phe 100 105 110 Cys Lys Phe Ser Leu Val Lys Lys Pro Lys Asn Ser Leu Gln Glu Ala 115 120 125 Glu Ser His Ser Arg Gly Cys Asp Glu Gln Asp Gly Gly Asp Thr Cys 130 135 140 Asn Val Gln Ile Ala Asp Asp Ser Glu Arg Ser Glu Glu Asp Ser Gly 145 150 155 160 Leu Leu Gln Asp Lys Asp His Ala Glu Gly Thr Thr Phe Val Ala Val 165 170 175 Val Gly Glu Gln Pro Thr Glu Val Glu Ile Trp Asp Leu Asn Thr Gly 180 185 190 Asp Lys Ile Ile Gln Leu Pro Gln Ser Ser Pro Asp Glu Ser Pro Asn 195 200 205 Ala Ser Thr Lys Gly Arg Gly Met Cys Met Ala Val Gln Leu Phe Cys 210 215 220 Pro Pro Glu Ser Gln Gly Phe Leu His Val Leu Ala Gly Tyr Glu Asp 225 230 235 240 Gly Ser Ile Leu Leu Trp Asp Ile Arg Asn Ala Lys Ile Pro Leu Thr 245 250 255 Ser Val Lys Phe His Ser Glu Pro Val Leu Ser Leu Ser Val Ala Ser 260 265 270 Ser Cys Asp Gly Gly Ile Ser Gly Gly Ala Asp Asp Lys Ile Val Met 275 280 285 Tyr Asn Leu Asn His Ser Thr Gly Ser Cys Thr Ile Arg Lys Glu Ile 290 295 300 Thr Leu Glu Arg Pro Gly Val Ser Gly Thr Ser Ile Arg Val Asp Gly 305 310 315 320 Lys Ile Ala Ala Thr Ala Gly Trp Asp His Arg Ile Arg Val Tyr Asn 325 330 335 Tyr Arg Lys Gly Asn Ala Leu Ala Ile Leu Lys Tyr His Arg Ala Thr 340 345 350 Cys Asn Ala Val Ser Tyr Ser Pro Asp Cys Glu Leu Met Ala Ser Ala 355 360 365 Ser Glu Asp Ala Thr Val Ala Leu Trp Lys Leu Tyr Pro Pro His Lys 370 375 380 Ser Leu 385 5 2700 DNA Arabidopsis thaliana CDS (1)..(2700) 5 atg gtg tcc ctc tct aat caa acc aga ttc tct ttc cac cca aac aat 48 Met Val Ser Leu Ser Asn Gln Thr Arg Phe Ser Phe His Pro Asn Asn 1 5 10 15 ctt gtc gtt tcc gag aaa cgg cgc ctg gga atc tca gga gtc aat ttt 96 Leu Val Val Ser Glu Lys Arg Arg Leu Gly Ile Ser Gly Val Asn Phe 20 25 30 ccc agg aag atc aag ttg aaa atc act tgc ttc gcc gcc gag aga cca 144 Pro Arg Lys Ile Lys Leu Lys Ile Thr Cys Phe Ala Ala Glu Arg Pro 35 40 45 cgc cag gag aaa cag aaa aag aag agt cag agc cag agc act agc gac 192 Arg Gln Glu Lys Gln Lys Lys Lys Ser Gln Ser Gln Ser Thr Ser Asp 50 55 60 gct gag gct gga gtt gac cct gtt ggg ttt ctc acc aga ctc gga att 240 Ala Glu Ala Gly Val Asp Pro Val Gly Phe Leu Thr Arg Leu Gly Ile 65 70 75 80 gct gac aga atc ttt gct caa ttt ctc cgg gaa agg cat aaa gct ttg 288 Ala Asp Arg Ile Phe Ala Gln Phe Leu Arg Glu Arg His Lys Ala Leu 85 90 95 aaa gac ctt aaa gat gag att ttc aaa cgc cat ttc gat ttc aga gat 336 Lys Asp Leu Lys Asp Glu Ile Phe Lys Arg His Phe Asp Phe Arg Asp 100 105 110 ttt gct tca gga ttt gag tta ttg ggg atg cat cga cat atg gaa cac 384 Phe Ala Ser Gly Phe Glu Leu Leu Gly Met His Arg His Met Glu His 115 120 125 cgg gtt gac ttt atg gac tgg ggt cca ggt tct cgt tat ggg gct ata 432 Arg Val Asp Phe Met Asp Trp Gly Pro Gly Ser Arg Tyr Gly Ala Ile 130 135 140 atc gga gat ttc aat gga tgg tcg cct aca gaa aat gca gct aga gag 480 Ile Gly Asp Phe Asn Gly Trp Ser Pro Thr Glu Asn Ala Ala Arg Glu 145 150 155 160 ggc cta ttt ggc cat gat gat tat ggg tat tgg ttc att att ctt gaa 528 Gly Leu Phe Gly His Asp Asp Tyr Gly Tyr Trp Phe Ile Ile Leu Glu 165 170 175 gat aag tta aga gag ggt gaa gaa cca gat gaa cta tac ttt cag cag 576 Asp Lys Leu Arg Glu Gly Glu Glu Pro Asp Glu Leu Tyr Phe Gln Gln 180 185 190 tat aac tat gtc gat gac tat gat aag ggt gat agt ggt gta tca gct 624 Tyr Asn Tyr Val Asp Asp Tyr Asp Lys Gly Asp Ser Gly Val Ser Ala 195 200 205 gaa gaa atc ttc cag aaa gct aat gat gag tat tgg gaa cct ggt gag 672 Glu Glu Ile Phe Gln Lys Ala Asn Asp Glu Tyr Trp Glu Pro Gly Glu 210 215 220 gat cgg ttt att aaa aac cgc ttt gaa gtt cct gcc aaa cta tac gag 720 Asp Arg Phe Ile Lys Asn Arg Phe Glu Val Pro Ala Lys Leu Tyr Glu 225 230 235 240 cag atg ttt ggc ccg aac agt cca caa acg tta gaa gag ctt gga gat 768 Gln Met Phe Gly Pro Asn Ser Pro Gln Thr Leu Glu Glu Leu Gly Asp 245 250 255 ata cca gat gca gaa acg aga tat aaa cag tgg aaa gag gag cat aaa 816 Ile Pro Asp Ala Glu Thr Arg Tyr Lys Gln Trp Lys Glu Glu His Lys 260 265 270 gat gac ccg cca agt aat cta cct cca tgc gac atc att gat aaa ggt 864 Asp Asp Pro Pro Ser Asn Leu Pro Pro Cys Asp Ile Ile Asp Lys Gly 275 280 285 caa gga aag cca tat gat att ttt aat gtc gtg aca tct cca gaa tgg 912 Gln Gly Lys Pro Tyr Asp Ile Phe Asn Val Val Thr Ser Pro Glu Trp 290 295 300 aca aag aaa ttt tat gag aaa gaa cct ccg att cca tat tgg tta gag 960 Thr Lys Lys Phe Tyr Glu Lys Glu Pro Pro Ile Pro Tyr Trp Leu Glu 305 310 315 320 aca cgt aaa gga aga aag gca tgg ttg caa aaa tat att cca gct gtt 1008 Thr Arg Lys Gly Arg Lys Ala Trp Leu Gln Lys Tyr Ile Pro Ala Val 325 330 335 cca cat gga agc aag tat aga ttg tat ttc aat act cct gat gga ccg 1056 Pro His Gly Ser Lys Tyr Arg Leu Tyr Phe Asn Thr Pro Asp Gly Pro 340 345 350 ctt gaa cga gtg cct gct tgg gct aca tat gtg caa cca gag gat gaa 1104 Leu Glu Arg Val Pro Ala Trp Ala Thr Tyr Val Gln Pro Glu Asp Glu 355 360 365 gga aag caa gct tac gca att cat tgg gaa cct tct cct gaa gct gca 1152 Gly Lys Gln Ala Tyr Ala Ile His Trp Glu Pro Ser Pro Glu Ala Ala 370 375 380 tac aag tgg aaa tat tcc aag cca aaa gtt cca gag tcc ttg cgc ata 1200 Tyr Lys Trp Lys Tyr Ser Lys Pro Lys Val Pro Glu Ser Leu Arg Ile 385 390 395 400 tac gaa tgt cat gtt gga atc agt ggg tcc gag cca aaa gtt tca act 1248 Tyr Glu Cys His Val Gly Ile Ser Gly Ser Glu Pro Lys Val Ser Thr 405 410 415 ttc gaa gaa ttt act aag aag gtc ctt cct cat gtg aaa aga gct gga 1296 Phe Glu Glu Phe Thr Lys Lys Val Leu Pro His Val Lys Arg Ala Gly 420 425 430 tac aat gca atc cag ttg att ggt gtc cct gag cac aag gat tat ttt 1344 Tyr Asn Ala Ile Gln Leu Ile Gly Val Pro Glu His Lys Asp Tyr Phe 435 440 445 act gtt ggt tat agg gtt acg aac ttc ttt gct gcc agt agc cga tat 1392 Thr Val Gly Tyr Arg Val Thr Asn Phe Phe Ala Ala Ser Ser Arg Tyr 450 455 460 ggc acg cca gat gat ttc aaa cgg ttg gtt gat gag gca cat ggc cta 1440 Gly Thr Pro Asp Asp Phe Lys Arg Leu Val Asp Glu Ala His Gly Leu 465 470 475 480 gga ctt ctt gtc ttt ttg gac atc gtg cat tct tat gca gca gct gat 1488 Gly Leu Leu Val Phe Leu Asp Ile Val His Ser Tyr Ala Ala Ala Asp 485 490 495 cag atg gtt ggg ctt tct ctc ttc gat ggt tca aat gat tgc tat ttt 1536 Gln Met Val Gly Leu Ser Leu Phe Asp Gly Ser Asn Asp Cys Tyr Phe 500 505 510 cat tat ggt aaa agg ggg cat cac aaa cac tgg ggc acc cgg atg ttc 1584 His Tyr Gly Lys Arg Gly His His Lys His Trp Gly Thr Arg Met Phe 515 520 525 aaa tac ggt gat ttg gat gtt ctt cat ttt cta ata tca aat ttg aac 1632 Lys Tyr Gly Asp Leu Asp Val Leu His Phe Leu Ile Ser Asn Leu Asn 530 535 540 tgg tgg att aca gag tat caa gtt gat ggt tac caa ttt cac tcg ctt 1680 Trp Trp Ile Thr Glu Tyr Gln Val Asp Gly Tyr Gln Phe His Ser Leu 545 550 555 560 gcc tcg atg att tac acg cac aat ggg ttt gct tca ttt aac aac gat 1728 Ala Ser Met Ile Tyr Thr His Asn Gly Phe Ala Ser Phe Asn Asn Asp 565 570 575 ttg gat gac tat tgc aat cag tat gtt gac cga gat gct ctg atg tac 1776 Leu Asp Asp Tyr Cys Asn Gln Tyr Val Asp Arg Asp Ala Leu Met Tyr 580 585 590 ctc att ttg gcc aat gaa atc ctg cac gtt caa cat cca aat ata ata 1824 Leu Ile Leu Ala Asn Glu Ile Leu His Val Gln His Pro Asn Ile Ile 595 600 605 aca att gct gag gat gca aca tat tac cct ggg ttg tgt gag cca gtt 1872 Thr Ile Ala Glu Asp Ala Thr Tyr Tyr Pro Gly Leu Cys Glu Pro Val 610 615 620 tct caa ggt gga ctt gga ttt gat tat tat gtg aat ctt tct gcg tca 1920 Ser Gln Gly Gly Leu Gly Phe Asp Tyr Tyr Val Asn Leu Ser Ala Ser 625 630 635 640 gaa atg tgg gtt tct ctc ctc gac aat gta ccg gat aat gaa tgg agc 1968 Glu Met Trp Val Ser Leu Leu Asp Asn Val Pro Asp Asn Glu Trp Ser 645 650 655 atg agc aag att gtc agt aca ttg gtg gct aac aaa gag tat gca gac 2016 Met Ser Lys Ile Val Ser Thr Leu Val Ala Asn Lys Glu Tyr Ala Asp 660 665 670 aag atg ctc agc tat gcc gaa aat cac aac caa tcc ata tca gga ggg 2064 Lys Met Leu Ser Tyr Ala Glu Asn His Asn Gln Ser Ile Ser Gly Gly 675 680 685 cgt tca ttt gct gaa atc tta ttc ggt gga gtc gat aat ggt tct cct 2112 Arg Ser Phe Ala Glu Ile Leu Phe Gly Gly Val Asp Asn Gly Ser Pro 690 695 700 gga gga aaa gaa ttg cta gac agg gga att tca cta cat aag atg att 2160 Gly Gly Lys Glu Leu Leu Asp Arg Gly Ile Ser Leu His Lys Met Ile 705 710 715 720 aga ctg att act ttt aca agt ggt ggc cgt gca tac ctc aat ttc atg 2208 Arg Leu Ile Thr Phe Thr Ser Gly Gly Arg Ala Tyr Leu Asn Phe Met 725 730 735 gga aat gaa ttt gga cat cct gag agg gtt gag ttt cca acg cag agc 2256 Gly Asn Glu Phe Gly His Pro Glu Arg Val Glu Phe Pro Thr Gln Ser 740 745 750 aat aac ttc tcg ttt tca ctt gct aac cgc cgc tgg gac ctg ctg gaa 2304 Asn Asn Phe Ser Phe Ser Leu Ala Asn Arg Arg Trp Asp Leu Leu Glu 755 760 765 agt gga gtg cat cat cac ttg ttt tcc ttt gac aag gag cta atg gac 2352 Ser Gly Val His His His Leu Phe Ser Phe Asp Lys Glu Leu Met Asp 770 775 780 ttg gat aaa agc aag ggt atc ctt tca aga ggt ctg ccc agc atc cac 2400 Leu Asp Lys Ser Lys Gly Ile Leu Ser Arg Gly Leu Pro Ser Ile His 785 790 795 800 cat gtg aat gat gca aat atg gtg att tct ttc tca agg ggt cct ttc 2448 His Val Asn Asp Ala Asn Met Val Ile Ser Phe Ser Arg Gly Pro Phe 805 810 815 cta ttt atc ttt aac ttc cac cca tca aat tca tat gaa aag tat gat 2496 Leu Phe Ile Phe Asn Phe His Pro Ser Asn Ser Tyr Glu Lys Tyr Asp 820 825 830 gtc ggt gta gag gaa gct ggt gaa tat acc atg ata ctg aac tcc gat 2544 Val Gly Val Glu Glu Ala Gly Glu Tyr Thr Met Ile Leu Asn Ser Asp 835 840 845 gaa gta aag tat ggg ggt caa ggg att gta aca gaa gac cac tat ctt 2592 Glu Val Lys Tyr Gly Gly Gln Gly Ile Val Thr Glu Asp His Tyr Leu 850 855 860 caa cgg tca att agc aag aga att gat ggt caa aga aat tgc tta gag 2640 Gln Arg Ser Ile Ser Lys Arg Ile Asp Gly Gln Arg Asn Cys Leu Glu 865 870 875 880 gtg ttt ttg cct agc cgg act gcc caa gtt tac aag ttg acc cgg atc 2688 Val Phe Leu Pro Ser Arg Thr Ala Gln Val Tyr Lys Leu Thr Arg Ile 885 890 895 ctc cga ata tga 2700 Leu Arg Ile 6 899 PRT Arabidopsis thaliana 6 Met Val Ser Leu Ser Asn Gln Thr Arg Phe Ser Phe His Pro Asn Asn 1 5 10 15 Leu Val Val Ser Glu Lys Arg Arg Leu Gly Ile Ser Gly Val Asn Phe 20 25 30 Pro Arg Lys Ile Lys Leu Lys Ile Thr Cys Phe Ala Ala Glu Arg Pro 35 40 45 Arg Gln Glu Lys Gln Lys Lys Lys Ser Gln Ser Gln Ser Thr Ser Asp 50 55 60 Ala Glu Ala Gly Val Asp Pro Val Gly Phe Leu Thr Arg Leu Gly Ile 65 70 75 80 Ala Asp Arg Ile Phe Ala Gln Phe Leu Arg Glu Arg His Lys Ala Leu 85 90 95 Lys Asp Leu Lys Asp Glu Ile Phe Lys Arg His Phe Asp Phe Arg Asp 100 105 110 Phe Ala Ser Gly Phe Glu Leu Leu Gly Met His Arg His Met Glu His 115 120 125 Arg Val Asp Phe Met Asp Trp Gly Pro Gly Ser Arg Tyr Gly Ala Ile 130 135 140 Ile Gly Asp Phe Asn Gly Trp Ser Pro Thr Glu Asn Ala Ala Arg Glu 145 150 155 160 Gly Leu Phe Gly His Asp Asp Tyr Gly Tyr Trp Phe Ile Ile Leu Glu 165 170 175 Asp Lys Leu Arg Glu Gly Glu Glu Pro Asp Glu Leu Tyr Phe Gln Gln 180 185 190 Tyr Asn Tyr Val Asp Asp Tyr Asp Lys Gly Asp Ser Gly Val Ser Ala 195 200 205 Glu Glu Ile Phe Gln Lys Ala Asn Asp Glu Tyr Trp Glu Pro Gly Glu 210 215 220 Asp Arg Phe Ile Lys Asn Arg Phe Glu Val Pro Ala Lys Leu Tyr Glu 225 230 235 240 Gln Met Phe Gly Pro Asn Ser Pro Gln Thr Leu Glu Glu Leu Gly Asp 245 250 255 Ile Pro Asp Ala Glu Thr Arg Tyr Lys Gln Trp Lys Glu Glu His Lys 260 265 270 Asp Asp Pro Pro Ser Asn Leu Pro Pro Cys Asp Ile Ile Asp Lys Gly 275 280 285 Gln Gly Lys Pro Tyr Asp Ile Phe Asn Val Val Thr Ser Pro Glu Trp 290 295 300 Thr Lys Lys Phe Tyr Glu Lys Glu Pro Pro Ile Pro Tyr Trp Leu Glu 305 310 315 320 Thr Arg Lys Gly Arg Lys Ala Trp Leu Gln Lys Tyr Ile Pro Ala Val 325 330 335 Pro His Gly Ser Lys Tyr Arg Leu Tyr Phe Asn Thr Pro Asp Gly Pro 340 345 350 Leu Glu Arg Val Pro Ala Trp Ala Thr Tyr Val Gln Pro Glu Asp Glu 355 360 365 Gly Lys Gln Ala Tyr Ala Ile His Trp Glu Pro Ser Pro Glu Ala Ala 370 375 380 Tyr Lys Trp Lys Tyr Ser Lys Pro Lys Val Pro Glu Ser Leu Arg Ile 385 390 395 400 Tyr Glu Cys His Val Gly Ile Ser Gly Ser Glu Pro Lys Val Ser Thr 405 410 415 Phe Glu Glu Phe Thr Lys Lys Val Leu Pro His Val Lys Arg Ala Gly 420 425 430 Tyr Asn Ala Ile Gln Leu Ile Gly Val Pro Glu His Lys Asp Tyr Phe 435 440 445 Thr Val Gly Tyr Arg Val Thr Asn Phe Phe Ala Ala Ser Ser Arg Tyr 450 455 460 Gly Thr Pro Asp Asp Phe Lys Arg Leu Val Asp Glu Ala His Gly Leu 465 470 475 480 Gly Leu Leu Val Phe Leu Asp Ile Val His Ser Tyr Ala Ala Ala Asp 485 490 495 Gln Met Val Gly Leu Ser Leu Phe Asp Gly Ser Asn Asp Cys Tyr Phe 500 505 510 His Tyr Gly Lys Arg Gly His His Lys His Trp Gly Thr Arg Met Phe 515 520 525 Lys Tyr Gly Asp Leu Asp Val Leu His Phe Leu Ile Ser Asn Leu Asn 530 535 540 Trp Trp Ile Thr Glu Tyr Gln Val Asp Gly Tyr Gln Phe His Ser Leu 545 550 555 560 Ala Ser Met Ile Tyr Thr His Asn Gly Phe Ala Ser Phe Asn Asn Asp 565 570 575 Leu Asp Asp Tyr Cys Asn Gln Tyr Val Asp Arg Asp Ala Leu Met Tyr 580 585 590 Leu Ile Leu Ala Asn Glu Ile Leu His Val Gln His Pro Asn Ile Ile 595 600 605 Thr Ile Ala Glu Asp Ala Thr Tyr Tyr Pro Gly Leu Cys Glu Pro Val 610 615 620 Ser Gln Gly Gly Leu Gly Phe Asp Tyr Tyr Val Asn Leu Ser Ala Ser 625 630 635 640 Glu Met Trp Val Ser Leu Leu Asp Asn Val Pro Asp Asn Glu Trp Ser 645 650 655 Met Ser Lys Ile Val Ser Thr Leu Val Ala Asn Lys Glu Tyr Ala Asp 660 665 670 Lys Met Leu Ser Tyr Ala Glu Asn His Asn Gln Ser Ile Ser Gly Gly 675 680 685 Arg Ser Phe Ala Glu Ile Leu Phe Gly Gly Val Asp Asn Gly Ser Pro 690 695 700 Gly Gly Lys Glu Leu Leu Asp Arg Gly Ile Ser Leu His Lys Met Ile 705 710 715 720 Arg Leu Ile Thr Phe Thr Ser Gly Gly Arg Ala Tyr Leu Asn Phe Met 725 730 735 Gly Asn Glu Phe Gly His Pro Glu Arg Val Glu Phe Pro Thr Gln Ser 740 745 750 Asn Asn Phe Ser Phe Ser Leu Ala Asn Arg Arg Trp Asp Leu Leu Glu 755 760 765 Ser Gly Val His His His Leu Phe Ser Phe Asp Lys Glu Leu Met Asp 770 775 780 Leu Asp Lys Ser Lys Gly Ile Leu Ser Arg Gly Leu Pro Ser Ile His 785 790 795 800 His Val Asn Asp Ala Asn Met Val Ile Ser Phe Ser Arg Gly Pro Phe 805 810 815 Leu Phe Ile Phe Asn Phe His Pro Ser Asn Ser Tyr Glu Lys Tyr Asp 820 825 830 Val Gly Val Glu Glu Ala Gly Glu Tyr Thr Met Ile Leu Asn Ser Asp 835 840 845 Glu Val Lys Tyr Gly Gly Gln Gly Ile Val Thr Glu Asp His Tyr Leu 850 855 860 Gln Arg Ser Ile Ser Lys Arg Ile Asp Gly Gln Arg Asn Cys Leu Glu 865 870 875 880 Val Phe Leu Pro Ser Arg Thr Ala Gln Val Tyr Lys Leu Thr Arg Ile 885 890 895 Leu Arg Ile 7 903 DNA Arabidopsis thaliana CDS (1)..(903) 7 atg gtg cta ctt ctc cga tat cga tcc tta acg ata aat ctc act cct 48 Met Val Leu Leu Leu Arg Tyr Arg Ser Leu Thr Ile Asn Leu Thr Pro 1 5 10 15 cta ata ccc aaa tct caa aaa ttc cac act ctc caa tca ttt cgt aac 96 Leu Ile Pro Lys Ser Gln Lys Phe His Thr Leu Gln Ser Phe Arg Asn 20 25 30 cct aat ttc atc tcc atc cca aaa atc tca gcc tcc acg aac aat cca 144 Pro Asn Phe Ile Ser Ile Pro Lys Ile Ser Ala Ser Thr Asn Asn Pro 35 40 45 aca aca aca acg aat cgg tcg atc tcc gac gcg act aaa ttc gct aag 192 Thr Thr Thr Thr Asn Arg Ser Ile Ser Asp Ala Thr Lys Phe Ala Lys 50 55 60 tca gtg tta ttc ata cct cct ggt gta gag att gag gaa cta aca gat 240 Ser Val Leu Phe Ile Pro Pro Gly Val Glu Ile Glu Glu Leu Thr Asp 65 70 75 80 gat atg gtt ctt ccg ggt tca aat atc gta atc gga cct ttt gcg ggt 288 Asp Met Val Leu Pro Gly Ser Asn Ile Val Ile Gly Pro Phe Ala Gly 85 90 95 cat tca caa atc aaa gaa gtt gaa ttt gtt aaa agt agt gct cgt gct 336 His Ser Gln Ile Lys Glu Val Glu Phe Val Lys Ser Ser Ala Arg Ala 100 105 110 aga gat tgt cct aaa gat gat cgt cct gag att gcg att ttg ggt cgt 384 Arg Asp Cys Pro Lys Asp Asp Arg Pro Glu Ile Ala Ile Leu Gly Arg 115 120 125 tct aat gtt ggc aag tct tcg ctt atc aat tgt ttg gtt cgt aag aaa 432 Ser Asn Val Gly Lys Ser Ser Leu Ile Asn Cys Leu Val Arg Lys Lys 130 135 140 gaa gtt gct ctt act tct aag aaa cct ggg aag act cag ctt ata aat 480 Glu Val Ala Leu Thr Ser Lys Lys Pro Gly Lys Thr Gln Leu Ile Asn 145 150 155 160 cac ttc ttg gtg aat aag agt tgg tac att gtg gat ttg cct ggt tat 528 His Phe Leu Val Asn Lys Ser Trp Tyr Ile Val Asp Leu Pro Gly Tyr 165 170 175 gga ttt gct aaa gtt tca gat gct gcg aaa aca gat tgg tct gcg ttt 576 Gly Phe Ala Lys Val Ser Asp Ala Ala Lys Thr Asp Trp Ser Ala Phe 180 185 190 act aaa ggt tac ttc ttg aat aga gac agt ctt gtt tgt gtt cta ctt 624 Thr Lys Gly Tyr Phe Leu Asn Arg Asp Ser Leu Val Cys Val Leu Leu 195 200 205 ctc atc gat gca agt gtt cct cct cag aag att gat ctt gat tgt gct 672 Leu Ile Asp Ala Ser Val Pro Pro Gln Lys Ile Asp Leu Asp Cys Ala 210 215 220 aat tgg ctt ggt cgc aac aat gta ccg atg act ttc gtg ttt acg aaa 720 Asn Trp Leu Gly Arg Asn Asn Val Pro Met Thr Phe Val Phe Thr Lys 225 230 235 240 tgc gat aaa atg aaa gca acg aaa ggg aaa aga cct gat gag aac att 768 Cys Asp Lys Met Lys Ala Thr Lys Gly Lys Arg Pro Asp Glu Asn Ile 245 250 255 aaa gct ttc caa caa atc att aga gag aat ttt aaa gta cat cct cct 816 Lys Ala Phe Gln Gln Ile Ile Arg Glu Asn Phe Lys Val His Pro Pro 260 265 270 tgg att ttg acg agt agt gta tct ggt tta ggc aga gac gag ctt ctt 864 Trp Ile Leu Thr Ser Ser Val Ser Gly Leu Gly Arg Asp Glu Leu Leu 275 280 285 ctt cac atg tcg cag ctg aga aac tac tgg gat caa tag 903 Leu His Met Ser Gln Leu Arg Asn Tyr Trp Asp Gln 290 295 300 8 300 PRT Arabidopsis thaliana 8 Met Val Leu Leu Leu Arg Tyr Arg Ser Leu Thr Ile Asn Leu Thr Pro 1 5 10 15 Leu Ile Pro Lys Ser Gln Lys Phe His Thr Leu Gln Ser Phe Arg Asn 20 25 30 Pro Asn Phe Ile Ser Ile Pro Lys Ile Ser Ala Ser Thr Asn Asn Pro 35 40 45 Thr Thr Thr Thr Asn Arg Ser Ile Ser Asp Ala Thr Lys Phe Ala Lys 50 55 60 Ser Val Leu Phe Ile Pro Pro Gly Val Glu Ile Glu Glu Leu Thr Asp 65 70 75 80 Asp Met Val Leu Pro Gly Ser Asn Ile Val Ile Gly Pro Phe Ala Gly 85 90 95 His Ser Gln Ile Lys Glu Val Glu Phe Val Lys Ser Ser Ala Arg Ala 100 105 110 Arg Asp Cys Pro Lys Asp Asp Arg Pro Glu Ile Ala Ile Leu Gly Arg 115 120 125 Ser Asn Val Gly Lys Ser Ser Leu Ile Asn Cys Leu Val Arg Lys Lys 130 135 140 Glu Val Ala Leu Thr Ser Lys Lys Pro Gly Lys Thr Gln Leu Ile Asn 145 150 155 160 His Phe Leu Val Asn Lys Ser Trp Tyr Ile Val Asp Leu Pro Gly Tyr 165 170 175 Gly Phe Ala Lys Val Ser Asp Ala Ala Lys Thr Asp Trp Ser Ala Phe 180 185 190 Thr Lys Gly Tyr Phe Leu Asn Arg Asp Ser Leu Val Cys Val Leu Leu 195 200 205 Leu Ile Asp Ala Ser Val Pro Pro Gln Lys Ile Asp Leu Asp Cys Ala 210 215 220 Asn Trp Leu Gly Arg Asn Asn Val Pro Met Thr Phe Val Phe Thr Lys 225 230 235 240 Cys Asp Lys Met Lys Ala Thr Lys Gly Lys Arg Pro Asp Glu Asn Ile 245 250 255 Lys Ala Phe Gln Gln Ile Ile Arg Glu Asn Phe Lys Val His Pro Pro 260 265 270 Trp Ile Leu Thr Ser Ser Val Ser Gly Leu Gly Arg Asp Glu Leu Leu 275 280 285 Leu His Met Ser Gln Leu Arg Asn Tyr Trp Asp Gln 290 295 300 9 2412 DNA Arabidopsis thaliana CDS (1)..(2412) 9 atg atg tcg gtg aga gca atc aat ggt tgc tcc atc att cgc acc gcc 48 Met Met Ser Val Arg Ala Ile Asn Gly Cys Ser Ile Ile Arg Thr Ala 1 5 10 15 act tcc gcc ggc ggt cct ccc gtc tct cta ttc cgc cac cgt att cag 96 Thr Ser Ala Gly Gly Pro Pro Val Ser Leu Phe Arg His Arg Ile Gln 20 25 30 cgc ctc aga gct tct cat ctc cgt gaa ttc tcg aaa tta cgt tta aat 144 Arg Leu Arg Ala Ser His Leu Arg Glu Phe Ser Lys Leu Arg Leu Asn 35 40 45 ttt cca cta atc cgc gcg gat agg cgg ttt ctc gga aat agt gat gct 192 Phe Pro Leu Ile Arg Ala Asp Arg Arg Phe Leu Gly Asn Ser Asp Ala 50 55 60 ccg agc tgt tcg act tgc ata cat agt ctg gtc gaa agt gtt tcc gaa 240 Pro Ser Cys Ser Thr Cys Ile His Ser Leu Val Glu Ser Val Ser Glu 65 70 75 80 gag ctc gaa tcc atc agc aga cgc aag gga agc cga atg aga gtt cgc 288 Glu Leu Glu Ser Ile Ser Arg Arg Lys Gly Ser Arg Met Arg Val Arg 85 90 95 gcc agt gta aag gtg aaa tta aca agc tac gga gag gtt cta gag gat 336 Ala Ser Val Lys Val Lys Leu Thr Ser Tyr Gly Glu Val Leu Glu Asp 100 105 110 aag cta gta aac cag gag cta gag gct ggt ttg ctt ctg gag ttt aag 384 Lys Leu Val Asn Gln Glu Leu Glu Ala Gly Leu Leu Leu Glu Phe Lys 115 120 125 aag gac gca gac aga gtg ttg ttg gca gtt ctt cat aga cgt gat ggc 432 Lys Asp Ala Asp Arg Val Leu Leu Ala Val Leu His Arg Arg Asp Gly 130 135 140 aag aag aat tgg atg gtg ttt gat cag aat ggt gtt agt tgt tcc atc 480 Lys Lys Asn Trp Met Val Phe Asp Gln Asn Gly Val Ser Cys Ser Ile 145 150 155 160 aaa cca cag caa att aca tac atc gtc ccg aat gtt tat aat ttt gat 528 Lys Pro Gln Gln Ile Thr Tyr Ile Val Pro Asn Val Tyr Asn Phe Asp 165 170 175 cat aca gga ctt act gat ttt ctc caa agg gct caa gat aac ttg gat 576 His Thr Gly Leu Thr Asp Phe Leu Gln Arg Ala Gln Asp Asn Leu Asp 180 185 190 ccc caa ctg ctt gag ttt gct tgg atg gag tta ctt gag aag aac aaa 624 Pro Gln Leu Leu Glu Phe Ala Trp Met Glu Leu Leu Glu Lys Asn Lys 195 200 205 cct gtg acc cca gaa gag tta gca gag atg ata tat ggt cgc gca gat 672 Pro Val Thr Pro Glu Glu Leu Ala Glu Met Ile Tyr Gly Arg Ala Asp 210 215 220 cct ctt gaa agc tat tgt gct cat ttt ttg cta tca caa gac gag ata 720 Pro Leu Glu Ser Tyr Cys Ala His Phe Leu Leu Ser Gln Asp Glu Ile 225 230 235 240 tac ttt tct atc ttg gag tca aaa ggt tct cga tca ata tat agc cct 768 Tyr Phe Ser Ile Leu Glu Ser Lys Gly Ser Arg Ser Ile Tyr Ser Pro 245 250 255 cgg cct act gaa cag gtg gaa gag ctt tta cga agg caa cgc gtg aag 816 Arg Pro Thr Glu Gln Val Glu Glu Leu Leu Arg Arg Gln Arg Val Lys 260 265 270 gag gca gaa gat aaa gag ttt cag gag ttt att caa ttg ctg aag tct 864 Glu Ala Glu Asp Lys Glu Phe Gln Glu Phe Ile Gln Leu Leu Lys Ser 275 280 285 gcc aag aag gca cct agt cac gcc aag ccc cct aaa tct tca tgg cta 912 Ala Lys Lys Ala Pro Ser His Ala Lys Pro Pro Lys Ser Ser Trp Leu 290 295 300 gct gat gat aaa gtc caa gat agg att ggc tca ctt gaa gct tat gcc 960 Ala Asp Asp Lys Val Gln Asp Arg Ile Gly Ser Leu Glu Ala Tyr Ala 305 310 315 320 att gat gca tgg gca agc act gac cag cag aaa ttg gct gga acg ata 1008 Ile Asp Ala Trp Ala Ser Thr Asp Gln Gln Lys Leu Ala Gly Thr Ile 325 330 335 tta aaa tcc atg ggg tta cag aaa acg tct gta tca gca cta aac ctt 1056 Leu Lys Ser Met Gly Leu Gln Lys Thr Ser Val Ser Ala Leu Asn Leu 340 345 350 ctc ata gat att gga tat ttt cct gta cat gtc aat ctt gaa ctg ctt 1104 Leu Ile Asp Ile Gly Tyr Phe Pro Val His Val Asn Leu Glu Leu Leu 355 360 365 aag ctg aat ctt cca acc cac cac tca gaa gct atc act gaa gct gct 1152 Lys Leu Asn Leu Pro Thr His His Ser Glu Ala Ile Thr Glu Ala Ala 370 375 380 gaa gct ctt ctt tca gaa tcg tct gat att gat gcg gtt agg agg att 1200 Glu Ala Leu Leu Ser Glu Ser Ser Asp Ile Asp Ala Val Arg Arg Ile 385 390 395 400 gat ctc aca cac ttg aag gtt tat gca att gat gtt gat gag gct gat 1248 Asp Leu Thr His Leu Lys Val Tyr Ala Ile Asp Val Asp Glu Ala Asp 405 410 415 gag ctt gat gat gcc ctc agt gca acg aga ttg cag gat gga cgg ata 1296 Glu Leu Asp Asp Ala Leu Ser Ala Thr Arg Leu Gln Asp Gly Arg Ile 420 425 430 aag atc tgg ata cat gtt gct gat cct gct aga tat gtt aca cct ggg 1344 Lys Ile Trp Ile His Val Ala Asp Pro Ala Arg Tyr Val Thr Pro Gly 435 440 445 agt aaa gtg gac aga gag gca agg aga agg gga act tct gtc ttt ctg 1392 Ser Lys Val Asp Arg Glu Ala Arg Arg Arg Gly Thr Ser Val Phe Leu 450 455 460 cca acg gct act tat cca atg ttt cca gag aaa ctt gcc atg gaa gga 1440 Pro Thr Ala Thr Tyr Pro Met Phe Pro Glu Lys Leu Ala Met Glu Gly 465 470 475 480 atg agc tta aga caa gga gaa aac tgt aat gct gtc tct gtg tct gtt 1488 Met Ser Leu Arg Gln Gly Glu Asn Cys Asn Ala Val Ser Val Ser Val 485 490 495 gtc ctg cgt tct gat ggc tgt att aca gaa tat tca gta gat aat tca 1536 Val Leu Arg Ser Asp Gly Cys Ile Thr Glu Tyr Ser Val Asp Asn Ser 500 505 510 atc atc aga ccg acc tat atg ttg acg tat gaa agt gct tct gag ctg 1584 Ile Ile Arg Pro Thr Tyr Met Leu Thr Tyr Glu Ser Ala Ser Glu Leu 515 520 525 ctt cac ttg aac cta gaa gaa gaa gct gaa ctg aaa ttg ctg tct gag 1632 Leu His Leu Asn Leu Glu Glu Glu Ala Glu Leu Lys Leu Leu Ser Glu 530 535 540 gca gca ttc att cgc tct cag tgg cgc cgt gaa cag ggt gca gtg gac 1680 Ala Ala Phe Ile Arg Ser Gln Trp Arg Arg Glu Gln Gly Ala Val Asp 545 550 555 560 aca act aca cta gaa act cgt atc aaa gtg gtg aat cct gag gat cca 1728 Thr Thr Thr Leu Glu Thr Arg Ile Lys Val Val Asn Pro Glu Asp Pro 565 570 575 gaa cct ttg ata aat ctc tat gta gaa aat cag gca gat cta gcg atg 1776 Glu Pro Leu Ile Asn Leu Tyr Val Glu Asn Gln Ala Asp Leu Ala Met 580 585 590 aga ctt gtc ttt gaa atg atg att ctt tgt ggg gag gtt gta gca acg 1824 Arg Leu Val Phe Glu Met Met Ile Leu Cys Gly Glu Val Val Ala Thr 595 600 605 ttt ggg tct cag cat aat att cca ttg cca tat aga gga cag cca cag 1872 Phe Gly Ser Gln His Asn Ile Pro Leu Pro Tyr Arg Gly Gln Pro Gln 610 615 620 tca aat att gat gta tct gca ttt gcg cat ctt cca gaa gga cca gta 1920 Ser Asn Ile Asp Val Ser Ala Phe Ala His Leu Pro Glu Gly Pro Val 625 630 635 640 cgt tcc tca tcc att gtc aaa gta atg cgt gct gca gaa atg aat ttc 1968 Arg Ser Ser Ser Ile Val Lys Val Met Arg Ala Ala Glu Met Asn Phe 645 650 655 aga tgt cct gtt cga cat gga gtg ctt gga att cct ggt tac gtt cag 2016 Arg Cys Pro Val Arg His Gly Val Leu Gly Ile Pro Gly Tyr Val Gln 660 665 670 ttc acg tct ccc att cgt cga tac atg gat ctt act gca cac tat cag 2064 Phe Thr Ser Pro Ile Arg Arg Tyr Met Asp Leu Thr Ala His Tyr Gln 675 680 685 att aaa gct ttt ctt cga gga ggt gac aat ttt cca ttc tct gct ggt 2112 Ile Lys Ala Phe Leu Arg Gly Gly Asp Asn Phe Pro Phe Ser Ala Gly 690 695 700 gag tta gaa ggt ata gca gca tcc gta aac atg cag agc aaa gtg gtg 2160 Glu Leu Glu Gly Ile Ala Ala Ser Val Asn Met Gln Ser Lys Val Val 705 710 715 720 aga aaa ctc tca aac acc ggt ctc cgc tat tgg gta ata gag ttt ctc 2208 Arg Lys Leu Ser Asn Thr Gly Leu Arg Tyr Trp Val Ile Glu Phe Leu 725 730 735 aga aga cag gaa aag ggt aaa aag tac acc gca ttg gtc ttg aga ttt 2256 Arg Arg Gln Glu Lys Gly Lys Lys Tyr Thr Ala Leu Val Leu Arg Phe 740 745 750 gtc aaa gac cgg atc gca tcc ctt ttg ttg gtt gag gtg ggt ttt caa 2304 Val Lys Asp Arg Ile Ala Ser Leu Leu Leu Val Glu Val Gly Phe Gln 755 760 765 gca acg gcg tgg gtg tca gaa gga aaa caa gtg gga gat gaa atc gaa 2352 Ala Thr Ala Trp Val Ser Glu Gly Lys Gln Val Gly Asp Glu Ile Glu 770 775 780 gtt cga gta gaa gaa gct cat ccc cgc gac gat ctt att ctc ttc aaa 2400 Val Arg Val Glu Glu Ala His Pro Arg Asp Asp Leu Ile Leu Phe Lys 785 790 795 800 gag gtt atc tag 2412 Glu Val Ile 10 803 PRT Arabidopsis thaliana 10 Met Met Ser Val Arg Ala Ile Asn Gly Cys Ser Ile Ile Arg Thr Ala 1 5 10 15 Thr Ser Ala Gly Gly Pro Pro Val Ser Leu Phe Arg His Arg Ile Gln 20 25 30 Arg Leu Arg Ala Ser His Leu Arg Glu Phe Ser Lys Leu Arg Leu Asn 35 40 45 Phe Pro Leu Ile Arg Ala Asp Arg Arg Phe Leu Gly Asn Ser Asp Ala 50 55 60 Pro Ser Cys Ser Thr Cys Ile His Ser Leu Val Glu Ser Val Ser Glu 65 70 75 80 Glu Leu Glu Ser Ile Ser Arg Arg Lys Gly Ser Arg Met Arg Val Arg 85 90 95 Ala Ser Val Lys Val Lys Leu Thr Ser Tyr Gly Glu Val Leu Glu Asp 100 105 110 Lys Leu Val Asn Gln Glu Leu Glu Ala Gly Leu Leu Leu Glu Phe Lys 115 120 125 Lys Asp Ala Asp Arg Val Leu Leu Ala Val Leu His Arg Arg Asp Gly 130 135 140 Lys Lys Asn Trp Met Val Phe Asp Gln Asn Gly Val Ser Cys Ser Ile 145 150 155 160 Lys Pro Gln Gln Ile Thr Tyr Ile Val Pro Asn Val Tyr Asn Phe Asp 165 170 175 His Thr Gly Leu Thr Asp Phe Leu Gln Arg Ala Gln Asp Asn Leu Asp 180 185 190 Pro Gln Leu Leu Glu Phe Ala Trp Met Glu Leu Leu Glu Lys Asn Lys 195 200 205 Pro Val Thr Pro Glu Glu Leu Ala Glu Met Ile Tyr Gly Arg Ala Asp 210 215 220 Pro Leu Glu Ser Tyr Cys Ala His Phe Leu Leu Ser Gln Asp Glu Ile 225 230 235 240 Tyr Phe Ser Ile Leu Glu Ser Lys Gly Ser Arg Ser Ile Tyr Ser Pro 245 250 255 Arg Pro Thr Glu Gln Val Glu Glu Leu Leu Arg Arg Gln Arg Val Lys 260 265 270 Glu Ala Glu Asp Lys Glu Phe Gln Glu Phe Ile Gln Leu Leu Lys Ser 275 280 285 Ala Lys Lys Ala Pro Ser His Ala Lys Pro Pro Lys Ser Ser Trp Leu 290 295 300 Ala Asp Asp Lys Val Gln Asp Arg Ile Gly Ser Leu Glu Ala Tyr Ala 305 310 315 320 Ile Asp Ala Trp Ala Ser Thr Asp Gln Gln Lys Leu Ala Gly Thr Ile 325 330 335 Leu Lys Ser Met Gly Leu Gln Lys Thr Ser Val Ser Ala Leu Asn Leu 340 345 350 Leu Ile Asp Ile Gly Tyr Phe Pro Val His Val Asn Leu Glu Leu Leu 355 360 365 Lys Leu Asn Leu Pro Thr His His Ser Glu Ala Ile Thr Glu Ala Ala 370 375 380 Glu Ala Leu Leu Ser Glu Ser Ser Asp Ile Asp Ala Val Arg Arg Ile 385 390 395 400 Asp Leu Thr His Leu Lys Val Tyr Ala Ile Asp Val Asp Glu Ala Asp 405 410 415 Glu Leu Asp Asp Ala Leu Ser Ala Thr Arg Leu Gln Asp Gly Arg Ile 420 425 430 Lys Ile Trp Ile His Val Ala Asp Pro Ala Arg Tyr Val Thr Pro Gly 435 440 445 Ser Lys Val Asp Arg Glu Ala Arg Arg Arg Gly Thr Ser Val Phe Leu 450 455 460 Pro Thr Ala Thr Tyr Pro Met Phe Pro Glu Lys Leu Ala Met Glu Gly 465 470 475 480 Met Ser Leu Arg Gln Gly Glu Asn Cys Asn Ala Val Ser Val Ser Val 485 490 495 Val Leu Arg Ser Asp Gly Cys Ile Thr Glu Tyr Ser Val Asp Asn Ser 500 505 510 Ile Ile Arg Pro Thr Tyr Met Leu Thr Tyr Glu Ser Ala Ser Glu Leu 515 520 525 Leu His Leu Asn Leu Glu Glu Glu Ala Glu Leu Lys Leu Leu Ser Glu 530 535 540 Ala Ala Phe Ile Arg Ser Gln Trp Arg Arg Glu Gln Gly Ala Val Asp 545 550 555 560 Thr Thr Thr Leu Glu Thr Arg Ile Lys Val Val Asn Pro Glu Asp Pro 565 570 575 Glu Pro Leu Ile Asn Leu Tyr Val Glu Asn Gln Ala Asp Leu Ala Met 580 585 590 Arg Leu Val Phe Glu Met Met Ile Leu Cys Gly Glu Val Val Ala Thr 595 600 605 Phe Gly Ser Gln His Asn Ile Pro Leu Pro Tyr Arg Gly Gln Pro Gln 610 615 620 Ser Asn Ile Asp Val Ser Ala Phe Ala His Leu Pro Glu Gly Pro Val 625 630 635 640 Arg Ser Ser Ser Ile Val Lys Val Met Arg Ala Ala Glu Met Asn Phe 645 650 655 Arg Cys Pro Val Arg His Gly Val Leu Gly Ile Pro Gly Tyr Val Gln 660 665 670 Phe Thr Ser Pro Ile Arg Arg Tyr Met Asp Leu Thr Ala His Tyr Gln 675 680 685 Ile Lys Ala Phe Leu Arg Gly Gly Asp Asn Phe Pro Phe Ser Ala Gly 690 695 700 Glu Leu Glu Gly Ile Ala Ala Ser Val Asn Met Gln Ser Lys Val Val 705 710 715 720 Arg Lys Leu Ser Asn Thr Gly Leu Arg Tyr Trp Val Ile Glu Phe Leu 725 730 735 Arg Arg Gln Glu Lys Gly Lys Lys Tyr Thr Ala Leu Val Leu Arg Phe 740 745 750 Val Lys Asp Arg Ile Ala Ser Leu Leu Leu Val Glu Val Gly Phe Gln 755 760 765 Ala Thr Ala Trp Val Ser Glu Gly Lys Gln Val Gly Asp Glu Ile Glu 770 775 780 Val Arg Val Glu Glu Ala His Pro Arg Asp Asp Leu Ile Leu Phe Lys 785 790 795 800 Glu Val Ile 11 3888 DNA Arabidopsis thaliana CDS (1)..(3888) 11 atg acg cag tcg cct gtt caa ccg cct ccg ccg ctc cca gct cag ccg 48 Met Thr Gln Ser Pro Val Gln Pro Pro Pro Pro Leu Pro Ala Gln Pro 1 5 10 15 cat tcc gcc gcc gga ggt gta ata aga ggc gat gta cag gtt agg tgc 96 His Ser Ala Ala Gly Gly Val Ile Arg Gly Asp Val Gln Val Arg Cys 20 25 30 gca ggc tgt cgc gtg att ctg agg gtg aaa acc ggc gtt gtg gag ttc 144 Ala Gly Cys Arg Val Ile Leu Arg Val Lys Thr Gly Val Val Glu Phe 35 40 45 tcg tgt ccg act tgt caa ctt ccc caa atg ttg ccg ccg gag ctt ctt 192 Ser Cys Pro Thr Cys Gln Leu Pro Gln Met Leu Pro Pro Glu Leu Leu 50 55 60 tct agg gca cgg ccg cag ttt ccg cag tct cca caa caa cca cca cag 240 Ser Arg Ala Arg Pro Gln Phe Pro Gln Ser Pro Gln Gln Pro Pro Gln 65 70 75 80 cca att cag acg ctt ccg cca ccg att caa cag cag ctt aag cct ctt 288 Pro Ile Gln Thr Leu Pro Pro Pro Ile Gln Gln Gln Leu Lys Pro Leu 85 90 95 aat cta cct agg cct ccg gtt ccg gca cat gga atc gat ccg acg aag 336 Asn Leu Pro Arg Pro Pro Val Pro Ala His Gly Ile Asp Pro Thr Lys 100 105 110 atg caa ttg cct tgc gct aac tgt cag gct atc ctc aat gtg cct cat 384 Met Gln Leu Pro Cys Ala Asn Cys Gln Ala Ile Leu Asn Val Pro His 115 120 125 gga ctc act cgg ttc tcg tgt cct cag tgc cat gtc gag ctc gct gtt 432 Gly Leu Thr Arg Phe Ser Cys Pro Gln Cys His Val Glu Leu Ala Val 130 135 140 gat gtc tcc aag ctt aat cgg tct tta act gct tct caa tct cat tct 480 Asp Val Ser Lys Leu Asn Arg Ser Leu Thr Ala Ser Gln Ser His Ser 145 150 155 160 aat cct cct act cct gct gct ccc aca gtc ccg cct ccg cct cct ccc 528 Asn Pro Pro Thr Pro Ala Ala Pro Thr Val Pro Pro Pro Pro Pro Pro 165 170 175 gaa gaa gtc aat gag gaa gct att gaa gtg gag aga gaa gaa gat gaa 576 Glu Glu Val Asn Glu Glu Ala Ile Glu Val Glu Arg Glu Glu Asp Glu 180 185 190 ggt gga acg gct gga gaa acg ttt atg gat tat cgt ccc cca aaa ctt 624 Gly Gly Thr Ala Gly Glu Thr Phe Met Asp Tyr Arg Pro Pro Lys Leu 195 200 205 tcg atc gga cct cct cat cct gat cca att gtg gaa aca tca tct cta 672 Ser Ile Gly Pro Pro His Pro Asp Pro Ile Val Glu Thr Ser Ser Leu 210 215 220 tct gca gtg caa ccc cca gag cct act tat gat ctc aaa atc aag gaa 720 Ser Ala Val Gln Pro Pro Glu Pro Thr Tyr Asp Leu Lys Ile Lys Glu 225 230 235 240 gag ctt gaa agg tca aag gct tta tcg tgt ttg cag att gag aca tta 768 Glu Leu Glu Arg Ser Lys Ala Leu Ser Cys Leu Gln Ile Glu Thr Leu 245 250 255 gtt tac gct tgc cag agg cat ctc cag cac ctt gct gat ggt acc aga 816 Val Tyr Ala Cys Gln Arg His Leu Gln His Leu Ala Asp Gly Thr Arg 260 265 270 gca ggc ttt ttt gtt gga gat ggt gct ggt gtg ggc aaa ggg cga aca 864 Ala Gly Phe Phe Val Gly Asp Gly Ala Gly Val Gly Lys Gly Arg Thr 275 280 285 att gcc ggg ttg att tgg gaa aat tgg aaa cat gga agg agg aaa gct 912 Ile Ala Gly Leu Ile Trp Glu Asn Trp Lys His Gly Arg Arg Lys Ala 290 295 300 ttg tgg att tct att ggc tca gac ttg aaa tat gat gca aga agg gac 960 Leu Trp Ile Ser Ile Gly Ser Asp Leu Lys Tyr Asp Ala Arg Arg Asp 305 310 315 320 ttg gat gat gtg ggt gct aca tgc gtt gga gtg aat cct ttg aac aag 1008 Leu Asp Asp Val Gly Ala Thr Cys Val Gly Val Asn Pro Leu Asn Lys 325 330 335 cta ccc tat tct aag ctt gac tcg aaa aat gtt ggt att aaa gaa gga 1056 Leu Pro Tyr Ser Lys Leu Asp Ser Lys Asn Val Gly Ile Lys Glu Gly 340 345 350 gtg gta ttc ttg aca tac aat agc ctc att gct tcc tct gag aag ggc 1104 Val Val Phe Leu Thr Tyr Asn Ser Leu Ile Ala Ser Ser Glu Lys Gly 355 360 365 cgt tct cgt tta cag cag ctt gtt caa tgg tgt gga cca gaa ttt gat 1152 Arg Ser Arg Leu Gln Gln Leu Val Gln Trp Cys Gly Pro Glu Phe Asp 370 375 380 ggt ctc ttg atc ttt gat gag tgc cat aaa gcg aaa aat ttg gta cct 1200 Gly Leu Leu Ile Phe Asp Glu Cys His Lys Ala Lys Asn Leu Val Pro 385 390 395 400 gaa gct ggg agt cag ccg aca cgc atc gga cag gca gtt gtt gac att 1248 Glu Ala Gly Ser Gln Pro Thr Arg Ile Gly Gln Ala Val Val Asp Ile 405 410 415 cag gat aag atc ccc caa gca cgt gtt att tat tgc tcg gct act ggg 1296 Gln Asp Lys Ile Pro Gln Ala Arg Val Ile Tyr Cys Ser Ala Thr Gly 420 425 430 gca tca gaa cca cga aac atg ggt tat atg gtc agg ctt ggt ctt tgg 1344 Ala Ser Glu Pro Arg Asn Met Gly Tyr Met Val Arg Leu Gly Leu Trp 435 440 445 gga gct gga aca agt ttt agt gat ttt aac aaa ttc cta ggt gcc ctt 1392 Gly Ala Gly Thr Ser Phe Ser Asp Phe Asn Lys Phe Leu Gly Ala Leu 450 455 460 gat aaa ggt ggg aca gga gcg ttg gaa ttg gtt gcc atg gat atg aaa 1440 Asp Lys Gly Gly Thr Gly Ala Leu Glu Leu Val Ala Met Asp Met Lys 465 470 475 480 gcg agg ggg atg tat gta tgc cga acc ctg agt tac aaa ggg gct gag 1488 Ala Arg Gly Met Tyr Val Cys Arg Thr Leu Ser Tyr Lys Gly Ala Glu 485 490 495 ttt gag att gtt gaa gct cgt tta gaa gca gga atg gag gca atg tac 1536 Phe Glu Ile Val Glu Ala Arg Leu Glu Ala Gly Met Glu Ala Met Tyr 500 505 510 aat aag tct gca gaa ttt tgg gca gag cta cgg ata gag ttg ttg tca 1584 Asn Lys Ser Ala Glu Phe Trp Ala Glu Leu Arg Ile Glu Leu Leu Ser 515 520 525 gca agt gct ttt ctc cct aac gag aaa cca aat tct agt cag ttg tgg 1632 Ala Ser Ala Phe Leu Pro Asn Glu Lys Pro Asn Ser Ser Gln Leu Trp 530 535 540 aga tta tac tgg tca agc cac cag cgt ttc ttt aga cac ctg tgt atg 1680 Arg Leu Tyr Trp Ser Ser His Gln Arg Phe Phe Arg His Leu Cys Met 545 550 555 560 tca gct aaa gtc cct gta act gtg agg tta gct aag aaa gcc ttg tca 1728 Ser Ala Lys Val Pro Val Thr Val Arg Leu Ala Lys Lys Ala Leu Ser 565 570 575 aca aac aag tgt gta gtt att ggc ctt cag agt act gga gag gct cga 1776 Thr Asn Lys Cys Val Val Ile Gly Leu Gln Ser Thr Gly Glu Ala Arg 580 585 590 acc gaa gag gct gta aat aaa tat ggt ctt gag ctt gat gat ttt gtc 1824 Thr Glu Glu Ala Val Asn Lys Tyr Gly Leu Glu Leu Asp Asp Phe Val 595 600 605 tcg gga cct cga gag ctc ttg ttg aaa ttt gtg gaa gag aat tat ccc 1872 Ser Gly Pro Arg Glu Leu Leu Leu Lys Phe Val Glu Glu Asn Tyr Pro 610 615 620 tta ccc gag cag cct gaa cct ctg tca gag gat gat agt gtt aaa gaa 1920 Leu Pro Glu Gln Pro Glu Pro Leu Ser Glu Asp Asp Ser Val Lys Glu 625 630 635 640 ctt cag agg aag agg cat tca gct tca cct ggt gtt tca att aga ggg 1968 Leu Gln Arg Lys Arg His Ser Ala Ser Pro Gly Val Ser Ile Arg Gly 645 650 655 aga gta aga aag atg gca aag tgg aaa cca gat agt gat aat gaa agt 2016 Arg Val Arg Lys Met Ala Lys Trp Lys Pro Asp Ser Asp Asn Glu Ser 660 665 670 gat ttg gag tct gaa gct gac tct gct gac gat tct aat gat tct gat 2064 Asp Leu Glu Ser Glu Ala Asp Ser Ala Asp Asp Ser Asn Asp Ser Asp 675 680 685 gat gag ttc cag ata tgt cag ata tgc agt ggg gaa gat gaa agg aag 2112 Asp Glu Phe Gln Ile Cys Gln Ile Cys Ser Gly Glu Asp Glu Arg Lys 690 695 700 aag ttg ctc cat tgt tct gaa tgt gac aag ctt ttt cat ccg gac tgt 2160 Lys Leu Leu His Cys Ser Glu Cys Asp Lys Leu Phe His Pro Asp Cys 705 710 715 720 gtg gtt cct cca gtt ata gac tta cca tct gaa gcg tgg att tgc ttt 2208 Val Val Pro Pro Val Ile Asp Leu Pro Ser Glu Ala Trp Ile Cys Phe 725 730 735 tct tgt aag gaa aag aca gag gag tac att caa gca aga cgt ctt tac 2256 Ser Cys Lys Glu Lys Thr Glu Glu Tyr Ile Gln Ala Arg Arg Leu Tyr 740 745 750 att gcc gaa ttg cag aaa agg tat gaa gca gcg ctt gaa cga aaa tca 2304 Ile Ala Glu Leu Gln Lys Arg Tyr Glu Ala Ala Leu Glu Arg Lys Ser 755 760 765 aag atc ata gag atc atc cgt tct ctg aat ctt cca aac aat cct ttg 2352 Lys Ile Ile Glu Ile Ile Arg Ser Leu Asn Leu Pro Asn Asn Pro Leu 770 775 780 gat gac atc gtc gat cag ctt gga ggc cct gaa aaa gtt gca gaa atg 2400 Asp Asp Ile Val Asp Gln Leu Gly Gly Pro Glu Lys Val Ala Glu Met 785 790 795 800 acg ggc agg cga ggt atg ctt gta aga gca tcg aat ggg aaa ggt gtt 2448 Thr Gly Arg Arg Gly Met Leu Val Arg Ala Ser Asn Gly Lys Gly Val 805 810 815 aca tat caa gca aga aac aca aag gat att acc atg gaa atg gtc aac 2496 Thr Tyr Gln Ala Arg Asn Thr Lys Asp Ile Thr Met Glu Met Val Asn 820 825 830 atg cac gag aag caa cta ttt atg gat ggt aaa aag ttg gtt gcc att 2544 Met His Glu Lys Gln Leu Phe Met Asp Gly Lys Lys Leu Val Ala Ile 835 840 845 att tct gaa gct ggg tca gct ggt gtt tcg ttg caa gca gat aga agg 2592 Ile Ser Glu Ala Gly Ser Ala Gly Val Ser Leu Gln Ala Asp Arg Arg 850 855 860 gca gta aat cag aaa aga agg gtt cac ttg act tta gag ctt ccc tgg 2640 Ala Val Asn Gln Lys Arg Arg Val His Leu Thr Leu Glu Leu Pro Trp 865 870 875 880 agt gct gat cgt gca att cag caa ttt gga aga act cac cgg tca aat 2688 Ser Ala Asp Arg Ala Ile Gln Gln Phe Gly Arg Thr His Arg Ser Asn 885 890 895 cag aca tct gca cct gaa tat aga cta ctc ttc act aat ctc ggt ggg 2736 Gln Thr Ser Ala Pro Glu Tyr Arg Leu Leu Phe Thr Asn Leu Gly Gly 900 905 910 gaa cgc cgc ttt gcc tct att gtt gca aag aga cta gaa aca ctt ggt 2784 Glu Arg Arg Phe Ala Ser Ile Val Ala Lys Arg Leu Glu Thr Leu Gly 915 920 925 gct tta aca cag gga gac cgc agg gct ggg ccc tcc ggg cct tct ttg 2832 Ala Leu Thr Gln Gly Asp Arg Arg Ala Gly Pro Ser Gly Pro Ser Leu 930 935 940 agt gcg tat aat tat gat agt aac ttt ggt aaa aaa tcc ttg atg gtg 2880 Ser Ala Tyr Asn Tyr Asp Ser Asn Phe Gly Lys Lys Ser Leu Met Val 945 950 955 960 atg tat cga ggc ata atg gag cag gag aaa cta cct gtt ctt cct cct 2928 Met Tyr Arg Gly Ile Met Glu Gln Glu Lys Leu Pro Val Leu Pro Pro 965 970 975 gga tgc tcg att gat gaa ccc gaa aca gtt aaa gag ttt ttg acc aag 2976 Gly Cys Ser Ile Asp Glu Pro Glu Thr Val Lys Glu Phe Leu Thr Lys 980 985 990 gca agg gct gct ctt gtt gcg gtt gga att gtt agg gac agt gtt ctg 3024 Ala Arg Ala Ala Leu Val Ala Val Gly Ile Val Arg Asp Ser Val Leu 995 1000 1005 gcc aat ggg aaa gat gtt gga agg ttt tca gga cgt att att gat 3069 Ala Asn Gly Lys Asp Val Gly Arg Phe Ser Gly Arg Ile Ile Asp 1010 1015 1020 tct gac atg cat gat gtt gga cga ttt ttg aat cgc ctc tta gga 3114 Ser Asp Met His Asp Val Gly Arg Phe Leu Asn Arg Leu Leu Gly 1025 1030 1035 ttg cca cct gat atc cag aat agg ctg ttt gaa ttg ttt act agt 3159 Leu Pro Pro Asp Ile Gln Asn Arg Leu Phe Glu Leu Phe Thr Ser 1040 1045 1050 ata tta gac gtt ctt gta cat aat gct cgg ata gaa gga agt ttt 3204 Ile Leu Asp Val Leu Val His Asn Ala Arg Ile Glu Gly Ser Phe 1055 1060 1065 gac tca gga att gtg gat atg aaa gct aat agt gtt gag cta ctg 3249 Asp Ser Gly Ile Val Asp Met Lys Ala Asn Ser Val Glu Leu Leu 1070 1075 1080 agc act cca aag acc gtc cat gtt gat caa atg tct ggt gcc tca 3294 Ser Thr Pro Lys Thr Val His Val Asp Gln Met Ser Gly Ala Ser 1085 1090 1095 act atg ctc ttt act ttt act cta gat cgt ggt gta aca tgg gag 3339 Thr Met Leu Phe Thr Phe Thr Leu Asp Arg Gly Val Thr Trp Glu 1100 1105 1110 tct gcg agt tca atg cta gag gga aag aga agg gac ggg ctt ggt 3384 Ser Ala Ser Ser Met Leu Glu Gly Lys Arg Arg Asp Gly Leu Gly 1115 1120 1125 tca gct aat gat ggc ttt ttt gaa tct aaa agg gaa tgg tta gga 3429 Ser Ala Asn Asp Gly Phe Phe Glu Ser Lys Arg Glu Trp Leu Gly 1130 1135 1140 aga cgg cac ttc ata tta gca ttt gag agt gcc gct tct ggg ttg 3474 Arg Arg His Phe Ile Leu Ala Phe Glu Ser Ala Ala Ser Gly Leu 1145 1150 1155 ttt aag att gta cgt cct gct gta gga gag tcg atc agg gag atg 3519 Phe Lys Ile Val Arg Pro Ala Val Gly Glu Ser Ile Arg Glu Met 1160 1165 1170 tct ctt tca gag ttg aaa act aaa tac cga aaa cta tct tct ctg 3564 Ser Leu Ser Glu Leu Lys Thr Lys Tyr Arg Lys Leu Ser Ser Leu 1175 1180 1185 gag aag gct cgt act ggt tgg gag gac gag tac gaa gtt tca tca 3609 Glu Lys Ala Arg Thr Gly Trp Glu Asp Glu Tyr Glu Val Ser Ser 1190 1195 1200 aaa cag tgt atg cat gga ccc aag tgt aag ctt ggg gag tat tgc 3654 Lys Gln Cys Met His Gly Pro Lys Cys Lys Leu Gly Glu Tyr Cys 1205 1210 1215 acg gtt gga cgt agg ata cag gaa gta aac gtt gtg ggt ggc ctt 3699 Thr Val Gly Arg Arg Ile Gln Glu Val Asn Val Val Gly Gly Leu 1220 1225 1230 att ctt ccc atc tgg gga acc atc gag aaa gct ctc tcc aaa caa 3744 Ile Leu Pro Ile Trp Gly Thr Ile Glu Lys Ala Leu Ser Lys Gln 1235 1240 1245 gcc cgt cac agc cac aag aga att cga gtt ata cgc ata gag act 3789 Ala Arg His Ser His Lys Arg Ile Arg Val Ile Arg Ile Glu Thr 1250 1255 1260 acg acg gat aat cag cgg ata gtg ggc ctc tcc atc cca aat gca 3834 Thr Thr Asp Asn Gln Arg Ile Val Gly Leu Ser Ile Pro Asn Ala 1265 1270 1275 gct gta gaa act gtt tta caa gat tta gca tgg gtt caa gag att 3879 Ala Val Glu Thr Val Leu Gln Asp Leu Ala Trp Val Gln Glu Ile 1280 1285 1290 gat gat taa 3888 Asp Asp 1295 12 1295 PRT Arabidopsis thaliana 12 Met Thr Gln Ser Pro Val Gln Pro Pro Pro Pro Leu Pro Ala Gln Pro 1 5 10 15 His Ser Ala Ala Gly Gly Val Ile Arg Gly Asp Val Gln Val Arg Cys 20 25 30 Ala Gly Cys Arg Val Ile Leu Arg Val Lys Thr Gly Val Val Glu Phe 35 40 45 Ser Cys Pro Thr Cys Gln Leu Pro Gln Met Leu Pro Pro Glu Leu Leu 50 55 60 Ser Arg Ala Arg Pro Gln Phe Pro Gln Ser Pro Gln Gln Pro Pro Gln 65 70 75 80 Pro Ile Gln Thr Leu Pro Pro Pro Ile Gln Gln Gln Leu Lys Pro Leu 85 90 95 Asn Leu Pro Arg Pro Pro Val Pro Ala His Gly Ile Asp Pro Thr Lys 100 105 110 Met Gln Leu Pro Cys Ala Asn Cys Gln Ala Ile Leu Asn Val Pro His 115 120 125 Gly Leu Thr Arg Phe Ser Cys Pro Gln Cys His Val Glu Leu Ala Val 130 135 140 Asp Val Ser Lys Leu Asn Arg Ser Leu Thr Ala Ser Gln Ser His Ser 145 150 155 160 Asn Pro Pro Thr Pro Ala Ala Pro Thr Val Pro Pro Pro Pro Pro Pro 165 170 175 Glu Glu Val Asn Glu Glu Ala Ile Glu Val Glu Arg Glu Glu Asp Glu 180 185 190 Gly Gly Thr Ala Gly Glu Thr Phe Met Asp Tyr Arg Pro Pro Lys Leu 195 200 205 Ser Ile Gly Pro Pro His Pro Asp Pro Ile Val Glu Thr Ser Ser Leu 210 215 220 Ser Ala Val Gln Pro Pro Glu Pro Thr Tyr Asp Leu Lys Ile Lys Glu 225 230 235 240 Glu Leu Glu Arg Ser Lys Ala Leu Ser Cys Leu Gln Ile Glu Thr Leu 245 250 255 Val Tyr Ala Cys Gln Arg His Leu Gln His Leu Ala Asp Gly Thr Arg 260 265 270 Ala Gly Phe Phe Val Gly Asp Gly Ala Gly Val Gly Lys Gly Arg Thr 275 280 285 Ile Ala Gly Leu Ile Trp Glu Asn Trp Lys His Gly Arg Arg Lys Ala 290 295 300 Leu Trp Ile Ser Ile Gly Ser Asp Leu Lys Tyr Asp Ala Arg Arg Asp 305 310 315 320 Leu Asp Asp Val Gly Ala Thr Cys Val Gly Val Asn Pro Leu Asn Lys 325 330 335 Leu Pro Tyr Ser Lys Leu Asp Ser Lys Asn Val Gly Ile Lys Glu Gly 340 345 350 Val Val Phe Leu Thr Tyr Asn Ser Leu Ile Ala Ser Ser Glu Lys Gly 355 360 365 Arg Ser Arg Leu Gln Gln Leu Val Gln Trp Cys Gly Pro Glu Phe Asp 370 375 380 Gly Leu Leu Ile Phe Asp Glu Cys His Lys Ala Lys Asn Leu Val Pro 385 390 395 400 Glu Ala Gly Ser Gln Pro Thr Arg Ile Gly Gln Ala Val Val Asp Ile 405 410 415 Gln Asp Lys Ile Pro Gln Ala Arg Val Ile Tyr Cys Ser Ala Thr Gly 420 425 430 Ala Ser Glu Pro Arg Asn Met Gly Tyr Met Val Arg Leu Gly Leu Trp 435 440 445 Gly Ala Gly Thr Ser Phe Ser Asp Phe Asn Lys Phe Leu Gly Ala Leu 450 455 460 Asp Lys Gly Gly Thr Gly Ala Leu Glu Leu Val Ala Met Asp Met Lys 465 470 475 480 Ala Arg Gly Met Tyr Val Cys Arg Thr Leu Ser Tyr Lys Gly Ala Glu 485 490 495 Phe Glu Ile Val Glu Ala Arg Leu Glu Ala Gly Met Glu Ala Met Tyr 500 505 510 Asn Lys Ser Ala Glu Phe Trp Ala Glu Leu Arg Ile Glu Leu Leu Ser 515 520 525 Ala Ser Ala Phe Leu Pro Asn Glu Lys Pro Asn Ser Ser Gln Leu Trp 530 535 540 Arg Leu Tyr Trp Ser Ser His Gln Arg Phe Phe Arg His Leu Cys Met 545 550 555 560 Ser Ala Lys Val Pro Val Thr Val Arg Leu Ala Lys Lys Ala Leu Ser 565 570 575 Thr Asn Lys Cys Val Val Ile Gly Leu Gln Ser Thr Gly Glu Ala Arg 580 585 590 Thr Glu Glu Ala Val Asn Lys Tyr Gly Leu Glu Leu Asp Asp Phe Val 595 600 605 Ser Gly Pro Arg Glu Leu Leu Leu Lys Phe Val Glu Glu Asn Tyr Pro 610 615 620 Leu Pro Glu Gln Pro Glu Pro Leu Ser Glu Asp Asp Ser Val Lys Glu 625 630 635 640 Leu Gln Arg Lys Arg His Ser Ala Ser Pro Gly Val Ser Ile Arg Gly 645 650 655 Arg Val Arg Lys Met Ala Lys Trp Lys Pro Asp Ser Asp Asn Glu Ser 660 665 670 Asp Leu Glu Ser Glu Ala Asp Ser Ala Asp Asp Ser Asn Asp Ser Asp 675 680 685 Asp Glu Phe Gln Ile Cys Gln Ile Cys Ser Gly Glu Asp Glu Arg Lys 690 695 700 Lys Leu Leu His Cys Ser Glu Cys Asp Lys Leu Phe His Pro Asp Cys 705 710 715 720 Val Val Pro Pro Val Ile Asp Leu Pro Ser Glu Ala Trp Ile Cys Phe 725 730 735 Ser Cys Lys Glu Lys Thr Glu Glu Tyr Ile Gln Ala Arg Arg Leu Tyr 740 745 750 Ile Ala Glu Leu Gln Lys Arg Tyr Glu Ala Ala Leu Glu Arg Lys Ser 755 760 765 Lys Ile Ile Glu Ile Ile Arg Ser Leu Asn Leu Pro Asn Asn Pro Leu 770 775 780 Asp Asp Ile Val Asp Gln Leu Gly Gly Pro Glu Lys Val Ala Glu Met 785 790 795 800 Thr Gly Arg Arg Gly Met Leu Val Arg Ala Ser Asn Gly Lys Gly Val 805 810 815 Thr Tyr Gln Ala Arg Asn Thr Lys Asp Ile Thr Met Glu Met Val Asn 820 825 830 Met His Glu Lys Gln Leu Phe Met Asp Gly Lys Lys Leu Val Ala Ile 835 840 845 Ile Ser Glu Ala Gly Ser Ala Gly Val Ser Leu Gln Ala Asp Arg Arg 850 855 860 Ala Val Asn Gln Lys Arg Arg Val His Leu Thr Leu Glu Leu Pro Trp 865 870 875 880 Ser Ala Asp Arg Ala Ile Gln Gln Phe Gly Arg Thr His Arg Ser Asn 885 890 895 Gln Thr Ser Ala Pro Glu Tyr Arg Leu Leu Phe Thr Asn Leu Gly Gly 900 905 910 Glu Arg Arg Phe Ala Ser Ile Val Ala Lys Arg Leu Glu Thr Leu Gly 915 920 925 Ala Leu Thr Gln Gly Asp Arg Arg Ala Gly Pro Ser Gly Pro Ser Leu 930 935 940 Ser Ala Tyr Asn Tyr Asp Ser Asn Phe Gly Lys Lys Ser Leu Met Val 945 950 955 960 Met Tyr Arg Gly Ile Met Glu Gln Glu Lys Leu Pro Val Leu Pro Pro 965 970 975 Gly Cys Ser Ile Asp Glu Pro Glu Thr Val Lys Glu Phe Leu Thr Lys 980 985 990 Ala Arg Ala Ala Leu Val Ala Val Gly Ile Val Arg Asp Ser Val Leu 995 1000 1005 Ala Asn Gly Lys Asp Val Gly Arg Phe Ser Gly Arg Ile Ile Asp 1010 1015 1020 Ser Asp Met His Asp Val Gly Arg Phe Leu Asn Arg Leu Leu Gly 1025 1030 1035 Leu Pro Pro Asp Ile Gln Asn Arg Leu Phe Glu Leu Phe Thr Ser 1040 1045 1050 Ile Leu Asp Val Leu Val His Asn Ala Arg Ile Glu Gly Ser Phe 1055 1060 1065 Asp Ser Gly Ile Val Asp Met Lys Ala Asn Ser Val Glu Leu Leu 1070 1075 1080 Ser Thr Pro Lys Thr Val His Val Asp Gln Met Ser Gly Ala Ser 1085 1090 1095 Thr Met Leu Phe Thr Phe Thr Leu Asp Arg Gly Val Thr Trp Glu 1100 1105 1110 Ser Ala Ser Ser Met Leu Glu Gly Lys Arg Arg Asp Gly Leu Gly 1115 1120 1125 Ser Ala Asn Asp Gly Phe Phe Glu Ser Lys Arg Glu Trp Leu Gly 1130 1135 1140 Arg Arg His Phe Ile Leu Ala Phe Glu Ser Ala Ala Ser Gly Leu 1145 1150 1155 Phe Lys Ile Val Arg Pro Ala Val Gly Glu Ser Ile Arg Glu Met 1160 1165 1170 Ser Leu Ser Glu Leu Lys Thr Lys Tyr Arg Lys Leu Ser Ser Leu 1175 1180 1185 Glu Lys Ala Arg Thr Gly Trp Glu Asp Glu Tyr Glu Val Ser Ser 1190 1195 1200 Lys Gln Cys Met His Gly Pro Lys Cys Lys Leu Gly Glu Tyr Cys 1205 1210 1215 Thr Val Gly Arg Arg Ile Gln Glu Val Asn Val Val Gly Gly Leu 1220 1225 1230 Ile Leu Pro Ile Trp Gly Thr Ile Glu Lys Ala Leu Ser Lys Gln 1235 1240 1245 Ala Arg His Ser His Lys Arg Ile Arg Val Ile Arg Ile Glu Thr 1250 1255 1260 Thr Thr Asp Asn Gln Arg Ile Val Gly Leu Ser Ile Pro Asn Ala 1265 1270 1275 Ala Val Glu Thr Val Leu Gln Asp Leu Ala Trp Val Gln Glu Ile 1280 1285 1290 Asp Asp 1295 13 1038 DNA Arabidopsis thaliana CDS (1)..(1038) 13 atg gtg ttg gta acc cat aaa ttg cag ggt tca cat gtc ttg tct cct 48 Met Val Leu Val Thr His Lys Leu Gln Gly Ser His Val Leu Ser Pro 1 5 10 15 tgg agt tcg cca aca tgg acc aga ggc ttg tgt tta aag aga cct gtt 96 Trp Ser Ser Pro Thr Trp Thr Arg Gly Leu Cys Leu Lys Arg Pro Val 20 25 30 acg aca gtg cac ttt gtt gag agg aaa gat aag cat tta cga ttg aag 144 Thr Thr Val His Phe Val Glu Arg Lys Asp Lys His Leu Arg Leu Lys 35 40 45 caa aaa tgt tgt ttt agt tta ggg tcc cct tgc aac ggt gga ttg aga 192 Gln Lys Cys Cys Phe Ser Leu Gly Ser Pro Cys Asn Gly Gly Leu Arg 50 55 60 gca aaa cct ttc aga gtt aca tct ttt aaa ggt ggc atc caa aat aat 240 Ala Lys Pro Phe Arg Val Thr Ser Phe Lys Gly Gly Ile Gln Asn Asn 65 70 75 80 gaa tca gga gga agt gag ggt ggg aag aaa gtt act aat aac tct gtc 288 Glu Ser Gly Gly Ser Glu Gly Gly Lys Lys Val Thr Asn Asn Ser Val 85 90 95 aaa cta tct tac cgt tca gat gat gac gaa aat aat gtg aat ggc tcc 336 Lys Leu Ser Tyr Arg Ser Asp Asp Asp Glu Asn Asn Val Asn Gly Ser 100 105 110 cca aag gca cag aac act tcg ttt tca tat acc tcg gaa aca gag gat 384 Pro Lys Ala Gln Asn Thr Ser Phe Ser Tyr Thr Ser Glu Thr Glu Asp 115 120 125 tcc atg aca ggg caa ccc gct att cag aaa cta ttc aag aaa tgg ttg 432 Ser Met Thr Gly Gln Pro Ala Ile Gln Lys Leu Phe Lys Lys Trp Leu 130 135 140 aca ttg ttg cgc aca cag tca cct att caa gtg att gac gag act tta 480 Thr Leu Leu Arg Thr Gln Ser Pro Ile Gln Val Ile Asp Glu Thr Leu 145 150 155 160 gga gga gaa cag gtt cca caa acg acg aag caa gaa act gaa aca gaa 528 Gly Gly Glu Gln Val Pro Gln Thr Thr Lys Gln Glu Thr Glu Thr Glu 165 170 175 att cgg aaa gca gaa agt ctc cag agt acc aaa aat aca gtt tgg tcc 576 Ile Arg Lys Ala Glu Ser Leu Gln Ser Thr Lys Asn Thr Val Trp Ser 180 185 190 tgg ttt tgg agt cta gac gca gct atc aag att ccg ttg ctg tta ttt 624 Trp Phe Trp Ser Leu Asp Ala Ala Ile Lys Ile Pro Leu Leu Leu Phe 195 200 205 gtt cca gct ttc cta gct gtt tac gcg atc tgt ggt gct gaa gtt aca 672 Val Pro Ala Phe Leu Ala Val Tyr Ala Ile Cys Gly Ala Glu Val Thr 210 215 220 aag gaa tta tcc cct atg tgg gtg gtt ggc ccc ttg atc atc gcc ctt 720 Lys Glu Leu Ser Pro Met Trp Val Val Gly Pro Leu Ile Ile Ala Leu 225 230 235 240 tat ata aaa atg ttc caa gga cta tgt tcc ctt tac gcc ttc tgc ttc 768 Tyr Ile Lys Met Phe Gln Gly Leu Cys Ser Leu Tyr Ala Phe Cys Phe 245 250 255 aac caa acc atc aaa atg ata aga aac cta cca tcg tat tat ctt gta 816 Asn Gln Thr Ile Lys Met Ile Arg Asn Leu Pro Ser Tyr Tyr Leu Val 260 265 270 gca tac caa tac att gcc gat gga aag ctc aaa gac gac gtt aaa gct 864 Ala Tyr Gln Tyr Ile Ala Asp Gly Lys Leu Lys Asp Asp Val Lys Ala 275 280 285 cta gtg ttt aaa cca gtg gtg gcc atc aag aac aca gac tac aaa gag 912 Leu Val Phe Lys Pro Val Val Ala Ile Lys Asn Thr Asp Tyr Lys Glu 290 295 300 ctc aca cgc acg aag ctg aaa cag ttc caa gaa tgg atc att gag aaa 960 Leu Thr Arg Thr Lys Leu Lys Gln Phe Gln Glu Trp Ile Ile Glu Lys 305 310 315 320 tac ctg gat ttt gtt gaa tct gta tgg cct tat tac tgc aga act atc 1008 Tyr Leu Asp Phe Val Glu Ser Val Trp Pro Tyr Tyr Cys Arg Thr Ile 325 330 335 aga ttc ctg aag agg gct aac ctg att tga 1038 Arg Phe Leu Lys Arg Ala Asn Leu Ile 340 345 14 345 PRT Arabidopsis thaliana 14 Met Val Leu Val Thr His Lys Leu Gln Gly Ser His Val Leu Ser Pro 1 5 10 15 Trp Ser Ser Pro Thr Trp Thr Arg Gly Leu Cys Leu Lys Arg Pro Val 20 25 30 Thr Thr Val His Phe Val Glu Arg Lys Asp Lys His Leu Arg Leu Lys 35 40 45 Gln Lys Cys Cys Phe Ser Leu Gly Ser Pro Cys Asn Gly Gly Leu Arg 50 55 60 Ala Lys Pro Phe Arg Val Thr Ser Phe Lys Gly Gly Ile Gln Asn Asn 65 70 75 80 Glu Ser Gly Gly Ser Glu Gly Gly Lys Lys Val Thr Asn Asn Ser Val 85 90 95 Lys Leu Ser Tyr Arg Ser Asp Asp Asp Glu Asn Asn Val Asn Gly Ser 100 105 110 Pro Lys Ala Gln Asn Thr Ser Phe Ser Tyr Thr Ser Glu Thr Glu Asp 115 120 125 Ser Met Thr Gly Gln Pro Ala Ile Gln Lys Leu Phe Lys Lys Trp Leu 130 135 140 Thr Leu Leu Arg Thr Gln Ser Pro Ile Gln Val Ile Asp Glu Thr Leu 145 150 155 160 Gly Gly Glu Gln Val Pro Gln Thr Thr Lys Gln Glu Thr Glu Thr Glu 165 170 175 Ile Arg Lys Ala Glu Ser Leu Gln Ser Thr Lys Asn Thr Val Trp Ser 180 185 190 Trp Phe Trp Ser Leu Asp Ala Ala Ile Lys Ile Pro Leu Leu Leu Phe 195 200 205 Val Pro Ala Phe Leu Ala Val Tyr Ala Ile Cys Gly Ala Glu Val Thr 210 215 220 Lys Glu Leu Ser Pro Met Trp Val Val Gly Pro Leu Ile Ile Ala Leu 225 230 235 240 Tyr Ile Lys Met Phe Gln Gly Leu Cys Ser Leu Tyr Ala Phe Cys Phe 245 250 255 Asn Gln Thr Ile Lys Met Ile Arg Asn Leu Pro Ser Tyr Tyr Leu Val 260 265 270 Ala Tyr Gln Tyr Ile Ala Asp Gly Lys Leu Lys Asp Asp Val Lys Ala 275 280 285 Leu Val Phe Lys Pro Val Val Ala Ile Lys Asn Thr Asp Tyr Lys Glu 290 295 300 Leu Thr Arg Thr Lys Leu Lys Gln Phe Gln Glu Trp Ile Ile Glu Lys 305 310 315 320 Tyr Leu Asp Phe Val Glu Ser Val Trp Pro Tyr Tyr Cys Arg Thr Ile 325 330 335 Arg Phe Leu Lys Arg Ala Asn Leu Ile 340 345 15 6474 DNA Arabidopsis thaliana CDS (1)..(6474) 15 atg gaa ggg gat gag cga gga gtc tta ctt gct tgt gta att tcg ggt 48 Met Glu Gly Asp Glu Arg Gly Val Leu Leu Ala Cys Val Ile Ser Gly 1 5 10 15 acc ctt ttc acg gtt ttc ggt tcg ggt tcg ttt tgg ata ctt tgg gct 96 Thr Leu Phe Thr Val Phe Gly Ser Gly Ser Phe Trp Ile Leu Trp Ala 20 25 30 gtt aat tgg cgg cca tgg cgt ctc tac agt tgg atc ttt gct aga aaa 144 Val Asn Trp Arg Pro Trp Arg Leu Tyr Ser Trp Ile Phe Ala Arg Lys 35 40 45 tgg cca aaa gta ttg caa ggt cct cag ctt gat ata cta tgt ggt gtt 192 Trp Pro Lys Val Leu Gln Gly Pro Gln Leu Asp Ile Leu Cys Gly Val 50 55 60 cta tct ctt ttt gct tgg att gtg gta gta tcc cct att gca atc ttg 240 Leu Ser Leu Phe Ala Trp Ile Val Val Val Ser Pro Ile Ala Ile Leu 65 70 75 80 ata gga tgg ggt tct tgg ctg att gtg ata ttg gat cga cat atc att 288 Ile Gly Trp Gly Ser Trp Leu Ile Val Ile Leu Asp Arg His Ile Ile 85 90 95 ggg ctg gcg ata ata atg gct gga rca gcc ctt tta ctg gca ttc tac 336 Gly Leu Ala Ile Ile Met Ala Gly Xaa Ala Leu Leu Leu Ala Phe Tyr 100 105 110 tca atc atg ctt tgg tgg agg acc cag tgg caa agc tca aga gct gtc 384 Ser Ile Met Leu Trp Trp Arg Thr Gln Trp Gln Ser Ser Arg Ala Val 115 120 125 gct tta ctt ctc ctt ctt ggt gtt gcc tta cta tgt gcg tat gaa ctc 432 Ala Leu Leu Leu Leu Leu Gly Val Ala Leu Leu Cys Ala Tyr Glu Leu 130 135 140 tgt gct gtc tat gtt acg gct ggt gcg cat gca tct cag caa tat tct 480 Cys Ala Val Tyr Val Thr Ala Gly Ala His Ala Ser Gln Gln Tyr Ser 145 150 155 160 cct tct ggt ttc ttt ttc ggt gta tca gca atc gcg ttg gca att aac 528 Pro Ser Gly Phe Phe Phe Gly Val Ser Ala Ile Ala Leu Ala Ile Asn 165 170 175 atg cta ttt atc tgc cgc atg gtc ttt aat gga aat ggt tta gat gtg 576 Met Leu Phe Ile Cys Arg Met Val Phe Asn Gly Asn Gly Leu Asp Val 180 185 190 gac gaa tat gta agg agg gca tat aaa ttt gct tat tca gat tgt ata 624 Asp Glu Tyr Val Arg Arg Ala Tyr Lys Phe Ala Tyr Ser Asp Cys Ile 195 200 205 gaa gtg ggt cct gtg gct tgt ttg cct gaa cct cct gat cct aat gaa 672 Glu Val Gly Pro Val Ala Cys Leu Pro Glu Pro Pro Asp Pro Asn Glu 210 215 220 tta tat ccc cgg caa acc agc agg gct tca cat ctt ggc ctt ctg tac 720 Leu Tyr Pro Arg Gln Thr Ser Arg Ala Ser His Leu Gly Leu Leu Tyr 225 230 235 240 ctg ggc tca ctc gta gtt ctc ctt gcc tac tca gtc cta tat ggt ctc 768 Leu Gly Ser Leu Val Val Leu Leu Ala Tyr Ser Val Leu Tyr Gly Leu 245 250 255 aca gct agg gaa tca cgt tgg ctt gga gga atc aca tca gct gca gtt 816 Thr Ala Arg Glu Ser Arg Trp Leu Gly Gly Ile Thr Ser Ala Ala Val 260 265 270 att gtt ctt gac tgg aat att ggg gca tgc ttg tat ggg ttt aag ctt 864 Ile Val Leu Asp Trp Asn Ile Gly Ala Cys Leu Tyr Gly Phe Lys Leu 275 280 285 ctt cag aat cgt gtt ctg gca ctt ttt gtt gct ggc ata tcc cgt ctt 912 Leu Gln Asn Arg Val Leu Ala Leu Phe Val Ala Gly Ile Ser Arg Leu 290 295 300 ttc cta ata tgt ttt ggc ata cac tac tgg tac cta ggg cat tgt att 960 Phe Leu Ile Cys Phe Gly Ile His Tyr Trp Tyr Leu Gly His Cys Ile 305 310 315 320 agt tac att ttc gta gca tca gtt cta tca ggt gct gct gtt tct cgg 1008 Ser Tyr Ile Phe Val Ala Ser Val Leu Ser Gly Ala Ala Val Ser Arg 325 330 335 cat cta tct ata aca gac cca tca gct gca aga aga gat gcc tta cag 1056 His Leu Ser Ile Thr Asp Pro Ser Ala Ala Arg Arg Asp Ala Leu Gln 340 345 350 agc aca gtg atc cgc ttg aga gaa ggt ttt cgg aga aaa gag cag aat 1104 Ser Thr Val Ile Arg Leu Arg Glu Gly Phe Arg Arg Lys Glu Gln Asn 355 360 365 agt tct tca agt tct tca gat ggt tgt ggc tca agt ata aaa aga agt 1152 Ser Ser Ser Ser Ser Ser Asp Gly Cys Gly Ser Ser Ile Lys Arg Ser 370 375 380 agt agt atc gat gct ggc cat act ggt tgt act aat gaa gca aat cgt 1200 Ser Ser Ile Asp Ala Gly His Thr Gly Cys Thr Asn Glu Ala Asn Arg 385 390 395 400 acg gca gaa tct tgc acg gct gac aat cta act cga aca ggc agc tct 1248 Thr Ala Glu Ser Cys Thr Ala Asp Asn Leu Thr Arg Thr Gly Ser Ser 405 410 415 cag gag gga atc aat agc gac aaa agc gaa gaa agt gga aga cca agc 1296 Gln Glu Gly Ile Asn Ser Asp Lys Ser Glu Glu Ser Gly Arg Pro Ser 420 425 430 tta ggt tta cgt agt agt tca tgt cgt tct gtg gtc caa gag ccc gaa 1344 Leu Gly Leu Arg Ser Ser Ser Cys Arg Ser Val Val Gln Glu Pro Glu 435 440 445 gca gga acg tct tat ttt atg gac aaa gtt tct gat caa aat aac act 1392 Ala Gly Thr Ser Tyr Phe Met Asp Lys Val Ser Asp Gln Asn Asn Thr 450 455 460 ctt gtt gtt tgt tcg agc agt ggt cta gat agc caa ggt tac gag tct 1440 Leu Val Val Cys Ser Ser Ser Gly Leu Asp Ser Gln Gly Tyr Glu Ser 465 470 475 480 agc aca tcg aat tct gca aac cag cag ctt ttg gat atg aat ttg gct 1488 Ser Thr Ser Asn Ser Ala Asn Gln Gln Leu Leu Asp Met Asn Leu Ala 485 490 495 ctt gct ttc cag gac cag tta aac aat cct agg ata gcc tcg ata ctt 1536 Leu Ala Phe Gln Asp Gln Leu Asn Asn Pro Arg Ile Ala Ser Ile Leu 500 505 510 aag aag aaa sma aaa gaa ggt gat ctt gaa ctg act aat ttg ctg caa 1584 Lys Lys Lys Xaa Lys Glu Gly Asp Leu Glu Leu Thr Asn Leu Leu Gln 515 520 525 gac aag ggg ttg gac cct aac ttt gct gta atg ttg aag gaa aaa aac 1632 Asp Lys Gly Leu Asp Pro Asn Phe Ala Val Met Leu Lys Glu Lys Asn 530 535 540 ttg gat cct gga aaa aac ttg ggt ctn act ata ttg gca cta ctt cag 1680 Leu Asp Pro Gly Lys Asn Leu Gly Xaa Thr Ile Leu Ala Leu Leu Gln 545 550 555 560 agg agt agt ttg gat gca gat aga gat cac cgc gac aat act gat att 1728 Arg Ser Ser Leu Asp Ala Asp Arg Asp His Arg Asp Asn Thr Asp Ile 565 570 575 aca atc att gac tca aac agt gtt gac aat act ttg cca aat cag att 1776 Thr Ile Ile Asp Ser Asn Ser Val Asp Asn Thr Leu Pro Asn Gln Ile 580 585 590 tct tta tcc gaa gaa ttg aga ctc cgt gga cta gag aag tgg ctt aag 1824 Ser Leu Ser Glu Glu Leu Arg Leu Arg Gly Leu Glu Lys Trp Leu Lys 595 600 605 ttg tct aga ctt ctt ctg cac cat gta gcg ggg aca cca gag aga gca 1872 Leu Ser Arg Leu Leu Leu His His Val Ala Gly Thr Pro Glu Arg Ala 610 615 620 tgg ggc ctc ttt agt ctt gtc ttt atc ctt gaa aca atc att gtg gcc 1920 Trp Gly Leu Phe Ser Leu Val Phe Ile Leu Glu Thr Ile Ile Val Ala 625 630 635 640 att ttt cgc cca aag acc atc acg att ata aat tct agt cat caa cag 1968 Ile Phe Arg Pro Lys Thr Ile Thr Ile Ile Asn Ser Ser His Gln Gln 645 650 655 ttc gaa ttt ggt ttc tct gtg ctg cta ttg tca cct gtt gtc tgt tca 2016 Phe Glu Phe Gly Phe Ser Val Leu Leu Leu Ser Pro Val Val Cys Ser 660 665 670 ata atg gct ttt ctt cgg tct ctt caa gtt gag gaa atg gcc ttg aca 2064 Ile Met Ala Phe Leu Arg Ser Leu Gln Val Glu Glu Met Ala Leu Thr 675 680 685 tca aaa tct cgc aag tat ggc ttt gtt gcc tgg ctt ctg agc aca tca 2112 Ser Lys Ser Arg Lys Tyr Gly Phe Val Ala Trp Leu Leu Ser Thr Ser 690 695 700 gtt gga ttg tca ctc tcg ttc ttg agt aaa tcg tca gta ctt ctg gga 2160 Val Gly Leu Ser Leu Ser Phe Leu Ser Lys Ser Ser Val Leu Leu Gly 705 710 715 720 ata tcc ttg act gtg ccc ctc atg gca gca tgc ctg tct att gct gtt 2208 Ile Ser Leu Thr Val Pro Leu Met Ala Ala Cys Leu Ser Ile Ala Val 725 730 735 ccc ata tgg atg cat aat ggg tac caa ttt tgg gtt cca cag tta tca 2256 Pro Ile Trp Met His Asn Gly Tyr Gln Phe Trp Val Pro Gln Leu Ser 740 745 750 tgt ggt gac cag gca aga gat tta cga tct ccc agg ata aag ggg ttt 2304 Cys Gly Asp Gln Ala Arg Asp Leu Arg Ser Pro Arg Ile Lys Gly Phe 755 760 765 att ctt tgg att tgt gtt gtg ttg ttt gcg ggt tct gta att tct ctt 2352 Ile Leu Trp Ile Cys Val Val Leu Phe Ala Gly Ser Val Ile Ser Leu 770 775 780 ggt gcg att ata tct gct aaa cct ttg gat gat tta aag tat aag ctg 2400 Gly Ala Ile Ile Ser Ala Lys Pro Leu Asp Asp Leu Lys Tyr Lys Leu 785 790 795 800 ttt agt gcc aga gaa aac aac gtc acg tca cca tat aca tct tct gta 2448 Phe Ser Ala Arg Glu Asn Asn Val Thr Ser Pro Tyr Thr Ser Ser Val 805 810 815 tac ctt ggt tgg gca atg tca tct gga att gct tta gta gtt acc gcc 2496 Tyr Leu Gly Trp Ala Met Ser Ser Gly Ile Ala Leu Val Val Thr Ala 820 825 830 att cta cca ata gtt tca tgg ttt gca act tat agg ttt tcc cac tct 2544 Ile Leu Pro Ile Val Ser Trp Phe Ala Thr Tyr Arg Phe Ser His Ser 835 840 845 tct gct gtc tgt ctc atg ata ttc tca gtt gtt ctc gtg gca ttt tgt 2592 Ser Ala Val Cys Leu Met Ile Phe Ser Val Val Leu Val Ala Phe Cys 850 855 860 gga act tca tat ttg gaa gtt gta aaa tct aga gat gat cag ttg ccc 2640 Gly Thr Ser Tyr Leu Glu Val Val Lys Ser Arg Asp Asp Gln Leu Pro 865 870 875 880 aca aag ggt gat ttc ctt gcg gcc ttg ctt cca ctt gca tgc att ccg 2688 Thr Lys Gly Asp Phe Leu Ala Ala Leu Leu Pro Leu Ala Cys Ile Pro 885 890 895 gcg ctg ctt tca cta tgc tgt ggg atg gtt aaa tgg aag gac gat tgt 2736 Ala Leu Leu Ser Leu Cys Cys Gly Met Val Lys Trp Lys Asp Asp Cys 900 905 910 tgg ata ctc tct cga ggt gta tat gtt ttc ttt tca ata ggt ctt ctt 2784 Trp Ile Leu Ser Arg Gly Val Tyr Val Phe Phe Ser Ile Gly Leu Leu 915 920 925 ctt ctt ttt ggt gcg ata gca gct gtc att gca gtc aaa cca tgg acg 2832 Leu Leu Phe Gly Ala Ile Ala Ala Val Ile Ala Val Lys Pro Trp Thr 930 935 940 ata ggc gta tct ttt ctc tta gtt ctt ttc ctt atg gtg gta aca att 2880 Ile Gly Val Ser Phe Leu Leu Val Leu Phe Leu Met Val Val Thr Ile 945 950 955 960 ggt gta atc cat ctt tgg gcg tca aac aat ttc tat tta acc agg aaa 2928 Gly Val Ile His Leu Trp Ala Ser Asn Asn Phe Tyr Leu Thr Arg Lys 965 970 975 cag aca tcc ttt gtc tgc ttt ctt gct ctt ctt ttg ggt ttg gcc gca 2976 Gln Thr Ser Phe Val Cys Phe Leu Ala Leu Leu Leu Gly Leu Ala Ala 980 985 990 ttc ctt ctc gga tgg cat caa gat aaa gca ttt gct gga gca tct gtt 3024 Phe Leu Leu Gly Trp His Gln Asp Lys Ala Phe Ala Gly Ala Ser Val 995 1000 1005 ggt tac ttt aca ttc ctg tct ctg ttg gct gga aga gca tta gct 3069 Gly Tyr Phe Thr Phe Leu Ser Leu Leu Ala Gly Arg Ala Leu Ala 1010 1015 1020 gtt ctt cta tcc cca cca att gta gta tat tct cca agg gtg cta 3114 Val Leu Leu Ser Pro Pro Ile Val Val Tyr Ser Pro Arg Val Leu 1025 1030 1035 cca gta tat gtc tac gat gct cat gct gat tgc gga aag aat gtc 3159 Pro Val Tyr Val Tyr Asp Ala His Ala Asp Cys Gly Lys Asn Val 1040 1045 1050 agt gct gca ttt ctt gtc ctg tat gga att gct ttg gca aca gaa 3204 Ser Ala Ala Phe Leu Val Leu Tyr Gly Ile Ala Leu Ala Thr Glu 1055 1060 1065 ggc tgg ggt gtt gtt gct agt ctg ata att tat cct ccg ttt gcg 3249 Gly Trp Gly Val Val Ala Ser Leu Ile Ile Tyr Pro Pro Phe Ala 1070 1075 1080 ggt gct gct gta tca gct atc acc ctt gta gta gcc ttt ggg ttt 3294 Gly Ala Ala Val Ser Ala Ile Thr Leu Val Val Ala Phe Gly Phe 1085 1090 1095 gct gtt tct cgc cca tgt ttg act ctt gag atg atg gag gtt gct 3339 Ala Val Ser Arg Pro Cys Leu Thr Leu Glu Met Met Glu Val Ala 1100 1105 1110 gta cgc ttt ctt agc aag gat act ata gtg caa gct atc tct cga 3384 Val Arg Phe Leu Ser Lys Asp Thr Ile Val Gln Ala Ile Ser Arg 1115 1120 1125 tct gcc acg aaa aca aga aat gct cta tcc ggc acg tat tca gct 3429 Ser Ala Thr Lys Thr Arg Asn Ala Leu Ser Gly Thr Tyr Ser Ala 1130 1135 1140 ccc caa agg tcc gcc agc tct gca gct ctt ctg gtt ggg gat ccc 3474 Pro Gln Arg Ser Ala Ser Ser Ala Ala Leu Leu Val Gly Asp Pro 1145 1150 1155 tct gca atg cgt gat aaa gca ggg aac ttt gtg ctt cct aga gat 3519 Ser Ala Met Arg Asp Lys Ala Gly Asn Phe Val Leu Pro Arg Asp 1160 1165 1170 gat gtc atg aaa tta agg gat cgt ctc agg aac gaa gaa aga gtt 3564 Asp Val Met Lys Leu Arg Asp Arg Leu Arg Asn Glu Glu Arg Val 1175 1180 1185 gct gga tca atc ttc tac aaa atg caa tgc agg aaa gga ttc cgt 3609 Ala Gly Ser Ile Phe Tyr Lys Met Gln Cys Arg Lys Gly Phe Arg 1190 1195 1200 cat gaa cca cct aca aat gta gat tat aga aga gac atg tgt gcc 3654 His Glu Pro Pro Thr Asn Val Asp Tyr Arg Arg Asp Met Cys Ala 1205 1210 1215 cat gca aga gtt ttg gca ctg gaa gag gca att gat aca gaa tgg 3699 His Ala Arg Val Leu Ala Leu Glu Glu Ala Ile Asp Thr Glu Trp 1220 1225 1230 gtg tat atg tgg gac aaa ttt ggt ggt tat tta cta cta ttg tta 3744 Val Tyr Met Trp Asp Lys Phe Gly Gly Tyr Leu Leu Leu Leu Leu 1235 1240 1245 ggt ttg aca gct aag gcg gag aga gtt cag gat gag gta cgg ttg 3789 Gly Leu Thr Ala Lys Ala Glu Arg Val Gln Asp Glu Val Arg Leu 1250 1255 1260 cgg ctc ttc tta gat agc att ggg ttc tcg gat tta agt gcc aga 3834 Arg Leu Phe Leu Asp Ser Ile Gly Phe Ser Asp Leu Ser Ala Arg 1265 1270 1275 aaa atc agt aaa tgg aag cca gag gat aga aga caa ttc gaa att 3879 Lys Ile Ser Lys Trp Lys Pro Glu Asp Arg Arg Gln Phe Glu Ile 1280 1285 1290 att caa gag agt tat ctg aga gag aaa gag atg gaa gag gaa agc 3924 Ile Gln Glu Ser Tyr Leu Arg Glu Lys Glu Met Glu Glu Glu Ser 1295 1300 1305 ctt atg cag aga cgt gaa gaa gaa ggg aga ggt aaa gaa aga agg 3969 Leu Met Gln Arg Arg Glu Glu Glu Gly Arg Gly Lys Glu Arg Arg 1310 1315 1320 aaa gct ctt ttg gag aag gaa gag cgc aaa tgg aag gaa att gaa 4014 Lys Ala Leu Leu Glu Lys Glu Glu Arg Lys Trp Lys Glu Ile Glu 1325 1330 1335 gcg tcc ctt att cca tct att cct aat gct ggt agc agg gag gca 4059 Ala Ser Leu Ile Pro Ser Ile Pro Asn Ala Gly Ser Arg Glu Ala 1340 1345 1350 gca gcc atg gca gct gca ata cgt gct gtt ggg ggt gat tct gtc 4104 Ala Ala Met Ala Ala Ala Ile Arg Ala Val Gly Gly Asp Ser Val 1355 1360 1365 ctt gag gat tcc ttc gca aga gag agg gtc tcg ggt att gca cgt 4149 Leu Glu Asp Ser Phe Ala Arg Glu Arg Val Ser Gly Ile Ala Arg 1370 1375 1380 agg ata cgc act gct caa cta gaa cga cgt gca caa cag act gga 4194 Arg Ile Arg Thr Ala Gln Leu Glu Arg Arg Ala Gln Gln Thr Gly 1385 1390 1395 ata tct ggg gca gtt tgt gtt ctt gat gat gaa cca atg ata agt 4239 Ile Ser Gly Ala Val Cys Val Leu Asp Asp Glu Pro Met Ile Ser 1400 1405 1410 ggt aaa cat tgc ggc caa atg gac tca agt gtc tgt caa agt cag 4284 Gly Lys His Cys Gly Gln Met Asp Ser Ser Val Cys Gln Ser Gln 1415 1420 1425 aag att agc ttt tcc gtt aca gca atg atc caa tcc gat tct gga 4329 Lys Ile Ser Phe Ser Val Thr Ala Met Ile Gln Ser Asp Ser Gly 1430 1435 1440 cct gta tgt ctt ttt ggc act gaa ttt caa aag aaa gta tgt tgg 4374 Pro Val Cys Leu Phe Gly Thr Glu Phe Gln Lys Lys Val Cys Trp 1445 1450 1455 gag att ctg gtt gct ggt tct gag caa gga att gag gct ggc caa 4419 Glu Ile Leu Val Ala Gly Ser Glu Gln Gly Ile Glu Ala Gly Gln 1460 1465 1470 gtt ggg ctt agg ttg ata aca aaa ggt gag agg cag aca acc gtt 4464 Val Gly Leu Arg Leu Ile Thr Lys Gly Glu Arg Gln Thr Thr Val 1475 1480 1485 gct aga gag tgg tat att ggt gca acc agc ata act gat gga agg 4509 Ala Arg Glu Trp Tyr Ile Gly Ala Thr Ser Ile Thr Asp Gly Arg 1490 1495 1500 tgg cat aca gtg aca atc aca att gat gct gat gcg ggg gaa gct 4554 Trp His Thr Val Thr Ile Thr Ile Asp Ala Asp Ala Gly Glu Ala 1505 1510 1515 act tgt tac ata gat ggt ggg ttt gat ggc tac cag aat ggg tta 4599 Thr Cys Tyr Ile Asp Gly Gly Phe Asp Gly Tyr Gln Asn Gly Leu 1520 1525 1530 cct cta agt att ggc agt gcc att tgg gaa caa gga gct gaa gtt 4644 Pro Leu Ser Ile Gly Ser Ala Ile Trp Glu Gln Gly Ala Glu Val 1535 1540 1545 tgg ttg ggt gtt agg cca cct ata gat gtt gat gca ttc ggg aga 4689 Trp Leu Gly Val Arg Pro Pro Ile Asp Val Asp Ala Phe Gly Arg 1550 1555 1560 tca gat agt gat ggc gtc gaa tca aag atg cat att atg gat gtt 4734 Ser Asp Ser Asp Gly Val Glu Ser Lys Met His Ile Met Asp Val 1565 1570 1575 ttc ctt tgg ggg aaa tgc tta agt gaa gaa gag gcc gct tct ttg 4779 Phe Leu Trp Gly Lys Cys Leu Ser Glu Glu Glu Ala Ala Ser Leu 1580 1585 1590 cat gca gcc att ggc atg gct gac tta gac atg att gat ttg tct 4824 His Ala Ala Ile Gly Met Ala Asp Leu Asp Met Ile Asp Leu Ser 1595 1600 1605 gat gac aat tgg caa tgg acg gat tca ccc ccc aga gtc gat ggt 4869 Asp Asp Asn Trp Gln Trp Thr Asp Ser Pro Pro Arg Val Asp Gly 1610 1615 1620 tgg gat agt gat cct gcc gat gtt gat ctc tat gat agg gat gac 4914 Trp Asp Ser Asp Pro Ala Asp Val Asp Leu Tyr Asp Arg Asp Asp 1625 1630 1635 gta gat tgg gat gga caa tat tcc agt ggg agg aaa aga aga tca 4959 Val Asp Trp Asp Gly Gln Tyr Ser Ser Gly Arg Lys Arg Arg Ser 1640 1645 1650 ggt cgg gat ttt gta atg agt gtc gat tcc ttt gcc agg aga cac 5004 Gly Arg Asp Phe Val Met Ser Val Asp Ser Phe Ala Arg Arg His 1655 1660 1665 agg aaa ccc agg atg gag aca caa gaa gat ata aat caa aga atg 5049 Arg Lys Pro Arg Met Glu Thr Gln Glu Asp Ile Asn Gln Arg Met 1670 1675 1680 cgt tca gtt gag ttg gct gtc aaa gaa gct ctc tct gca cga ggt 5094 Arg Ser Val Glu Leu Ala Val Lys Glu Ala Leu Ser Ala Arg Gly 1685 1690 1695 gat aag caa ttt act gac cag gaa ttt cct cca aat gat cgc tct 5139 Asp Lys Gln Phe Thr Asp Gln Glu Phe Pro Pro Asn Asp Arg Ser 1700 1705 1710 tta ttt gtg gat aca caa aat ccc cca tca aaa ttg cag gtt gtt 5184 Leu Phe Val Asp Thr Gln Asn Pro Pro Ser Lys Leu Gln Val Val 1715 1720 1725 tct gaa tgg atg aga cct gac tcc att gtg aaa gaa aac ggt agt 5229 Ser Glu Trp Met Arg Pro Asp Ser Ile Val Lys Glu Asn Gly Ser 1730 1735 1740 gat tcc cgt ccc tgc ctg ttc tct ggg gat gca aat cct tca gat 5274 Asp Ser Arg Pro Cys Leu Phe Ser Gly Asp Ala Asn Pro Ser Asp 1745 1750 1755 gtt tgc cag ggg cgt ttg ggg gat tgt tgg ttc tta agc gcc gtt 5319 Val Cys Gln Gly Arg Leu Gly Asp Cys Trp Phe Leu Ser Ala Val 1760 1765 1770 gca gtt ttg aca gag gtt tca cga ata tct gaa gtg atc att act 5364 Ala Val Leu Thr Glu Val Ser Arg Ile Ser Glu Val Ile Ile Thr 1775 1780 1785 cct gaa tac aac gag gaa ggg atc tac act gtt cgt ttt tgt att 5409 Pro Glu Tyr Asn Glu Glu Gly Ile Tyr Thr Val Arg Phe Cys Ile 1790 1795 1800 cag ggt gag tgg gtt cct gtt gtt atc gat gac tgg att cca tgt 5454 Gln Gly Glu Trp Val Pro Val Val Ile Asp Asp Trp Ile Pro Cys 1805 1810 1815 gaa tca cct ggt aaa cca gct ttt gct act agc aga aag ctc aat 5499 Glu Ser Pro Gly Lys Pro Ala Phe Ala Thr Ser Arg Lys Leu Asn 1820 1825 1830 gaa ctc tgg gtc tcc atg gtg gag aaa gca tat gcc aag ctc cat 5544 Glu Leu Trp Val Ser Met Val Glu Lys Ala Tyr Ala Lys Leu His 1835 1840 1845 ggt tct tat gag gca ctg gag ggg gga ctg gtt cag gat gct ctt 5589 Gly Ser Tyr Glu Ala Leu Glu Gly Gly Leu Val Gln Asp Ala Leu 1850 1855 1860 gtc gac cta act gga gga gct ggt gag gag att gac ttg cgg agt 5634 Val Asp Leu Thr Gly Gly Ala Gly Glu Glu Ile Asp Leu Arg Ser 1865 1870 1875 gct caa gca caa ata gat ctt gca agt ggc aga ttg tgg tct caa 5679 Ala Gln Ala Gln Ile Asp Leu Ala Ser Gly Arg Leu Trp Ser Gln 1880 1885 1890 ttg tta cgt ttt aaa caa gag ggg ttc tta ctt ggt gct gga agt 5724 Leu Leu Arg Phe Lys Gln Glu Gly Phe Leu Leu Gly Ala Gly Ser 1895 1900 1905 cca tca gga tct gat gtt cat gta tct tcc agt ggc att gtg caa 5769 Pro Ser Gly Ser Asp Val His Val Ser Ser Ser Gly Ile Val Gln 1910 1915 1920 ggg cat gct tac tcc gtc tta cag gtg aga gag gtt gat ggg cac 5814 Gly His Ala Tyr Ser Val Leu Gln Val Arg Glu Val Asp Gly His 1925 1930 1935 aga ctt gtt cag att cga aat cca tgg gct aat gaa gtt gag tgg 5859 Arg Leu Val Gln Ile Arg Asn Pro Trp Ala Asn Glu Val Glu Trp 1940 1945 1950 aat ggt ccc tgg tca gac tca tcc cca gag tgg act gat agg atg 5904 Asn Gly Pro Trp Ser Asp Ser Ser Pro Glu Trp Thr Asp Arg Met 1955 1960 1965 aag cac aag ctg aag cat gtt cca cag tca aaa gaa ggt ata ttc 5949 Lys His Lys Leu Lys His Val Pro Gln Ser Lys Glu Gly Ile Phe 1970 1975 1980 tgg atg tct tgg caa gat ttc cag att cat ttc aga tca ata tat 5994 Trp Met Ser Trp Gln Asp Phe Gln Ile His Phe Arg Ser Ile Tyr 1985 1990 1995 gtg tgt cgg gtt tac ccc cgt gag atg cgc tac tct gta aat ggc 6039 Val Cys Arg Val Tyr Pro Arg Glu Met Arg Tyr Ser Val Asn Gly 2000 2005 2010 caa tgg cga ggt tat agt gcc ggt ggc tgc caa gat tat agc tca 6084 Gln Trp Arg Gly Tyr Ser Ala Gly Gly Cys Gln Asp Tyr Ser Ser 2015 2020 2025 tgg cat caa aat cca caa ttc agg ctg agg gca act ggt tct gat 6129 Trp His Gln Asn Pro Gln Phe Arg Leu Arg Ala Thr Gly Ser Asp 2030 2035 2040 gca tct tta cca att cat gtg ttc atc acc tta act cag ggc gta 6174 Ala Ser Leu Pro Ile His Val Phe Ile Thr Leu Thr Gln Gly Val 2045 2050 2055 ggt ttc tcg aga aca act cct gga ttt cgt aac tac caa tca agc 6219 Gly Phe Ser Arg Thr Thr Pro Gly Phe Arg Asn Tyr Gln Ser Ser 2060 2065 2070 cat gat tca cag ttg ttc tat atc gga ttg agg att ctt aaa act 6264 His Asp Ser Gln Leu Phe Tyr Ile Gly Leu Arg Ile Leu Lys Thr 2075 2080 2085 cgt gga cgt cgt gct gct tac aac ata ttt ctt cat gaa tct gtt 6309 Arg Gly Arg Arg Ala Ala Tyr Asn Ile Phe Leu His Glu Ser Val 2090 2095 2100 ggt gga aca gac tat gtg aat tcc cgt gag att tca tgt gaa atg 6354 Gly Gly Thr Asp Tyr Val Asn Ser Arg Glu Ile Ser Cys Glu Met 2105 2110 2115 gtt ctt gac cct gat cct aag ggt tat act att gtc cca acc acg 6399 Val Leu Asp Pro Asp Pro Lys Gly Tyr Thr Ile Val Pro Thr Thr 2120 2125 2130 ata cac cca ggg gaa gaa gca cct ttt gtc ctt tca gtc ttc aca 6444 Ile His Pro Gly Glu Glu Ala Pro Phe Val Leu Ser Val Phe Thr 2135 2140 2145 aaa gca tcc att gtt ctt gaa gct ttg tag 6474 Lys Ala Ser Ile Val Leu Glu Ala Leu 2150 2155 16 2157 PRT Arabidopsis thaliana misc_feature (105)..(105) The ′Xaa′ at location 105 stands for Ala, or Thr. 16 Met Glu Gly Asp Glu Arg Gly Val Leu Leu Ala Cys Val Ile Ser Gly 1 5 10 15 Thr Leu Phe Thr Val Phe Gly Ser Gly Ser Phe Trp Ile Leu Trp Ala 20 25 30 Val Asn Trp Arg Pro Trp Arg Leu Tyr Ser Trp Ile Phe Ala Arg Lys 35 40 45 Trp Pro Lys Val Leu Gln Gly Pro Gln Leu Asp Ile Leu Cys Gly Val 50 55 60 Leu Ser Leu Phe Ala Trp Ile Val Val Val Ser Pro Ile Ala Ile Leu 65 70 75 80 Ile Gly Trp Gly Ser Trp Leu Ile Val Ile Leu Asp Arg His Ile Ile 85 90 95 Gly Leu Ala Ile Ile Met Ala Gly Xaa Ala Leu Leu Leu Ala Phe Tyr 100 105 110 Ser Ile Met Leu Trp Trp Arg Thr Gln Trp Gln Ser Ser Arg Ala Val 115 120 125 Ala Leu Leu Leu Leu Leu Gly Val Ala Leu Leu Cys Ala Tyr Glu Leu 130 135 140 Cys Ala Val Tyr Val Thr Ala Gly Ala His Ala Ser Gln Gln Tyr Ser 145 150 155 160 Pro Ser Gly Phe Phe Phe Gly Val Ser Ala Ile Ala Leu Ala Ile Asn 165 170 175 Met Leu Phe Ile Cys Arg Met Val Phe Asn Gly Asn Gly Leu Asp Val 180 185 190 Asp Glu Tyr Val Arg Arg Ala Tyr Lys Phe Ala Tyr Ser Asp Cys Ile 195 200 205 Glu Val Gly Pro Val Ala Cys Leu Pro Glu Pro Pro Asp Pro Asn Glu 210 215 220 Leu Tyr Pro Arg Gln Thr Ser Arg Ala Ser His Leu Gly Leu Leu Tyr 225 230 235 240 Leu Gly Ser Leu Val Val Leu Leu Ala Tyr Ser Val Leu Tyr Gly Leu 245 250 255 Thr Ala Arg Glu Ser Arg Trp Leu Gly Gly Ile Thr Ser Ala Ala Val 260 265 270 Ile Val Leu Asp Trp Asn Ile Gly Ala Cys Leu Tyr Gly Phe Lys Leu 275 280 285 Leu Gln Asn Arg Val Leu Ala Leu Phe Val Ala Gly Ile Ser Arg Leu 290 295 300 Phe Leu Ile Cys Phe Gly Ile His Tyr Trp Tyr Leu Gly His Cys Ile 305 310 315 320 Ser Tyr Ile Phe Val Ala Ser Val Leu Ser Gly Ala Ala Val Ser Arg 325 330 335 His Leu Ser Ile Thr Asp Pro Ser Ala Ala Arg Arg Asp Ala Leu Gln 340 345 350 Ser Thr Val Ile Arg Leu Arg Glu Gly Phe Arg Arg Lys Glu Gln Asn 355 360 365 Ser Ser Ser Ser Ser Ser Asp Gly Cys Gly Ser Ser Ile Lys Arg Ser 370 375 380 Ser Ser Ile Asp Ala Gly His Thr Gly Cys Thr Asn Glu Ala Asn Arg 385 390 395 400 Thr Ala Glu Ser Cys Thr Ala Asp Asn Leu Thr Arg Thr Gly Ser Ser 405 410 415 Gln Glu Gly Ile Asn Ser Asp Lys Ser Glu Glu Ser Gly Arg Pro Ser 420 425 430 Leu Gly Leu Arg Ser Ser Ser Cys Arg Ser Val Val Gln Glu Pro Glu 435 440 445 Ala Gly Thr Ser Tyr Phe Met Asp Lys Val Ser Asp Gln Asn Asn Thr 450 455 460 Leu Val Val Cys Ser Ser Ser Gly Leu Asp Ser Gln Gly Tyr Glu Ser 465 470 475 480 Ser Thr Ser Asn Ser Ala Asn Gln Gln Leu Leu Asp Met Asn Leu Ala 485 490 495 Leu Ala Phe Gln Asp Gln Leu Asn Asn Pro Arg Ile Ala Ser Ile Leu 500 505 510 Lys Lys Lys Xaa Lys Glu Gly Asp Leu Glu Leu Thr Asn Leu Leu Gln 515 520 525 Asp Lys Gly Leu Asp Pro Asn Phe Ala Val Met Leu Lys Glu Lys Asn 530 535 540 Leu Asp Pro Gly Lys Asn Leu Gly Xaa Thr Ile Leu Ala Leu Leu Gln 545 550 555 560 Arg Ser Ser Leu Asp Ala Asp Arg Asp His Arg Asp Asn Thr Asp Ile 565 570 575 Thr Ile Ile Asp Ser Asn Ser Val Asp Asn Thr Leu Pro Asn Gln Ile 580 585 590 Ser Leu Ser Glu Glu Leu Arg Leu Arg Gly Leu Glu Lys Trp Leu Lys 595 600 605 Leu Ser Arg Leu Leu Leu His His Val Ala Gly Thr Pro Glu Arg Ala 610 615 620 Trp Gly Leu Phe Ser Leu Val Phe Ile Leu Glu Thr Ile Ile Val Ala 625 630 635 640 Ile Phe Arg Pro Lys Thr Ile Thr Ile Ile Asn Ser Ser His Gln Gln 645 650 655 Phe Glu Phe Gly Phe Ser Val Leu Leu Leu Ser Pro Val Val Cys Ser 660 665 670 Ile Met Ala Phe Leu Arg Ser Leu Gln Val Glu Glu Met Ala Leu Thr 675 680 685 Ser Lys Ser Arg Lys Tyr Gly Phe Val Ala Trp Leu Leu Ser Thr Ser 690 695 700 Val Gly Leu Ser Leu Ser Phe Leu Ser Lys Ser Ser Val Leu Leu Gly 705 710 715 720 Ile Ser Leu Thr Val Pro Leu Met Ala Ala Cys Leu Ser Ile Ala Val 725 730 735 Pro Ile Trp Met His Asn Gly Tyr Gln Phe Trp Val Pro Gln Leu Ser 740 745 750 Cys Gly Asp Gln Ala Arg Asp Leu Arg Ser Pro Arg Ile Lys Gly Phe 755 760 765 Ile Leu Trp Ile Cys Val Val Leu Phe Ala Gly Ser Val Ile Ser Leu 770 775 780 Gly Ala Ile Ile Ser Ala Lys Pro Leu Asp Asp Leu Lys Tyr Lys Leu 785 790 795 800 Phe Ser Ala Arg Glu Asn Asn Val Thr Ser Pro Tyr Thr Ser Ser Val 805 810 815 Tyr Leu Gly Trp Ala Met Ser Ser Gly Ile Ala Leu Val Val Thr Ala 820 825 830 Ile Leu Pro Ile Val Ser Trp Phe Ala Thr Tyr Arg Phe Ser His Ser 835 840 845 Ser Ala Val Cys Leu Met Ile Phe Ser Val Val Leu Val Ala Phe Cys 850 855 860 Gly Thr Ser Tyr Leu Glu Val Val Lys Ser Arg Asp Asp Gln Leu Pro 865 870 875 880 Thr Lys Gly Asp Phe Leu Ala Ala Leu Leu Pro Leu Ala Cys Ile Pro 885 890 895 Ala Leu Leu Ser Leu Cys Cys Gly Met Val Lys Trp Lys Asp Asp Cys 900 905 910 Trp Ile Leu Ser Arg Gly Val Tyr Val Phe Phe Ser Ile Gly Leu Leu 915 920 925 Leu Leu Phe Gly Ala Ile Ala Ala Val Ile Ala Val Lys Pro Trp Thr 930 935 940 Ile Gly Val Ser Phe Leu Leu Val Leu Phe Leu Met Val Val Thr Ile 945 950 955 960 Gly Val Ile His Leu Trp Ala Ser Asn Asn Phe Tyr Leu Thr Arg Lys 965 970 975 Gln Thr Ser Phe Val Cys Phe Leu Ala Leu Leu Leu Gly Leu Ala Ala 980 985 990 Phe Leu Leu Gly Trp His Gln Asp Lys Ala Phe Ala Gly Ala Ser Val 995 1000 1005 Gly Tyr Phe Thr Phe Leu Ser Leu Leu Ala Gly Arg Ala Leu Ala 1010 1015 1020 Val Leu Leu Ser Pro Pro Ile Val Val Tyr Ser Pro Arg Val Leu 1025 1030 1035 Pro Val Tyr Val Tyr Asp Ala His Ala Asp Cys Gly Lys Asn Val 1040 1045 1050 Ser Ala Ala Phe Leu Val Leu Tyr Gly Ile Ala Leu Ala Thr Glu 1055 1060 1065 Gly Trp Gly Val Val Ala Ser Leu Ile Ile Tyr Pro Pro Phe Ala 1070 1075 1080 Gly Ala Ala Val Ser Ala Ile Thr Leu Val Val Ala Phe Gly Phe 1085 1090 1095 Ala Val Ser Arg Pro Cys Leu Thr Leu Glu Met Met Glu Val Ala 1100 1105 1110 Val Arg Phe Leu Ser Lys Asp Thr Ile Val Gln Ala Ile Ser Arg 1115 1120 1125 Ser Ala Thr Lys Thr Arg Asn Ala Leu Ser Gly Thr Tyr Ser Ala 1130 1135 1140 Pro Gln Arg Ser Ala Ser Ser Ala Ala Leu Leu Val Gly Asp Pro 1145 1150 1155 Ser Ala Met Arg Asp Lys Ala Gly Asn Phe Val Leu Pro Arg Asp 1160 1165 1170 Asp Val Met Lys Leu Arg Asp Arg Leu Arg Asn Glu Glu Arg Val 1175 1180 1185 Ala Gly Ser Ile Phe Tyr Lys Met Gln Cys Arg Lys Gly Phe Arg 1190 1195 1200 His Glu Pro Pro Thr Asn Val Asp Tyr Arg Arg Asp Met Cys Ala 1205 1210 1215 His Ala Arg Val Leu Ala Leu Glu Glu Ala Ile Asp Thr Glu Trp 1220 1225 1230 Val Tyr Met Trp Asp Lys Phe Gly Gly Tyr Leu Leu Leu Leu Leu 1235 1240 1245 Gly Leu Thr Ala Lys Ala Glu Arg Val Gln Asp Glu Val Arg Leu 1250 1255 1260 Arg Leu Phe Leu Asp Ser Ile Gly Phe Ser Asp Leu Ser Ala Arg 1265 1270 1275 Lys Ile Ser Lys Trp Lys Pro Glu Asp Arg Arg Gln Phe Glu Ile 1280 1285 1290 Ile Gln Glu Ser Tyr Leu Arg Glu Lys Glu Met Glu Glu Glu Ser 1295 1300 1305 Leu Met Gln Arg Arg Glu Glu Glu Gly Arg Gly Lys Glu Arg Arg 1310 1315 1320 Lys Ala Leu Leu Glu Lys Glu Glu Arg Lys Trp Lys Glu Ile Glu 1325 1330 1335 Ala Ser Leu Ile Pro Ser Ile Pro Asn Ala Gly Ser Arg Glu Ala 1340 1345 1350 Ala Ala Met Ala Ala Ala Ile Arg Ala Val Gly Gly Asp Ser Val 1355 1360 1365 Leu Glu Asp Ser Phe Ala Arg Glu Arg Val Ser Gly Ile Ala Arg 1370 1375 1380 Arg Ile Arg Thr Ala Gln Leu Glu Arg Arg Ala Gln Gln Thr Gly 1385 1390 1395 Ile Ser Gly Ala Val Cys Val Leu Asp Asp Glu Pro Met Ile Ser 1400 1405 1410 Gly Lys His Cys Gly Gln Met Asp Ser Ser Val Cys Gln Ser Gln 1415 1420 1425 Lys Ile Ser Phe Ser Val Thr Ala Met Ile Gln Ser Asp Ser Gly 1430 1435 1440 Pro Val Cys Leu Phe Gly Thr Glu Phe Gln Lys Lys Val Cys Trp 1445 1450 1455 Glu Ile Leu Val Ala Gly Ser Glu Gln Gly Ile Glu Ala Gly Gln 1460 1465 1470 Val Gly Leu Arg Leu Ile Thr Lys Gly Glu Arg Gln Thr Thr Val 1475 1480 1485 Ala Arg Glu Trp Tyr Ile Gly Ala Thr Ser Ile Thr Asp Gly Arg 1490 1495 1500 Trp His Thr Val Thr Ile Thr Ile Asp Ala Asp Ala Gly Glu Ala 1505 1510 1515 Thr Cys Tyr Ile Asp Gly Gly Phe Asp Gly Tyr Gln Asn Gly Leu 1520 1525 1530 Pro Leu Ser Ile Gly Ser Ala Ile Trp Glu Gln Gly Ala Glu Val 1535 1540 1545 Trp Leu Gly Val Arg Pro Pro Ile Asp Val Asp Ala Phe Gly Arg 1550 1555 1560 Ser Asp Ser Asp Gly Val Glu Ser Lys Met His Ile Met Asp Val 1565 1570 1575 Phe Leu Trp Gly Lys Cys Leu Ser Glu Glu Glu Ala Ala Ser Leu 1580 1585 1590 His Ala Ala Ile Gly Met Ala Asp Leu Asp Met Ile Asp Leu Ser 1595 1600 1605 Asp Asp Asn Trp Gln Trp Thr Asp Ser Pro Pro Arg Val Asp Gly 1610 1615 1620 Trp Asp Ser Asp Pro Ala Asp Val Asp Leu Tyr Asp Arg Asp Asp 1625 1630 1635 Val Asp Trp Asp Gly Gln Tyr Ser Ser Gly Arg Lys Arg Arg Ser 1640 1645 1650 Gly Arg Asp Phe Val Met Ser Val Asp Ser Phe Ala Arg Arg His 1655 1660 1665 Arg Lys Pro Arg Met Glu Thr Gln Glu Asp Ile Asn Gln Arg Met 1670 1675 1680 Arg Ser Val Glu Leu Ala Val Lys Glu Ala Leu Ser Ala Arg Gly 1685 1690 1695 Asp Lys Gln Phe Thr Asp Gln Glu Phe Pro Pro Asn Asp Arg Ser 1700 1705 1710 Leu Phe Val Asp Thr Gln Asn Pro Pro Ser Lys Leu Gln Val Val 1715 1720 1725 Ser Glu Trp Met Arg Pro Asp Ser Ile Val Lys Glu Asn Gly Ser 1730 1735 1740 Asp Ser Arg Pro Cys Leu Phe Ser Gly Asp Ala Asn Pro Ser Asp 1745 1750 1755 Val Cys Gln Gly Arg Leu Gly Asp Cys Trp Phe Leu Ser Ala Val 1760 1765 1770 Ala Val Leu Thr Glu Val Ser Arg Ile Ser Glu Val Ile Ile Thr 1775 1780 1785 Pro Glu Tyr Asn Glu Glu Gly Ile Tyr Thr Val Arg Phe Cys Ile 1790 1795 1800 Gln Gly Glu Trp Val Pro Val Val Ile Asp Asp Trp Ile Pro Cys 1805 1810 1815 Glu Ser Pro Gly Lys Pro Ala Phe Ala Thr Ser Arg Lys Leu Asn 1820 1825 1830 Glu Leu Trp Val Ser Met Val Glu Lys Ala Tyr Ala Lys Leu His 1835 1840 1845 Gly Ser Tyr Glu Ala Leu Glu Gly Gly Leu Val Gln Asp Ala Leu 1850 1855 1860 Val Asp Leu Thr Gly Gly Ala Gly Glu Glu Ile Asp Leu Arg Ser 1865 1870 1875 Ala Gln Ala Gln Ile Asp Leu Ala Ser Gly Arg Leu Trp Ser Gln 1880 1885 1890 Leu Leu Arg Phe Lys Gln Glu Gly Phe Leu Leu Gly Ala Gly Ser 1895 1900 1905 Pro Ser Gly Ser Asp Val His Val Ser Ser Ser Gly Ile Val Gln 1910 1915 1920 Gly His Ala Tyr Ser Val Leu Gln Val Arg Glu Val Asp Gly His 1925 1930 1935 Arg Leu Val Gln Ile Arg Asn Pro Trp Ala Asn Glu Val Glu Trp 1940 1945 1950 Asn Gly Pro Trp Ser Asp Ser Ser Pro Glu Trp Thr Asp Arg Met 1955 1960 1965 Lys His Lys Leu Lys His Val Pro Gln Ser Lys Glu Gly Ile Phe 1970 1975 1980 Trp Met Ser Trp Gln Asp Phe Gln Ile His Phe Arg Ser Ile Tyr 1985 1990 1995 Val Cys Arg Val Tyr Pro Arg Glu Met Arg Tyr Ser Val Asn Gly 2000 2005 2010 Gln Trp Arg Gly Tyr Ser Ala Gly Gly Cys Gln Asp Tyr Ser Ser 2015 2020 2025 Trp His Gln Asn Pro Gln Phe Arg Leu Arg Ala Thr Gly Ser Asp 2030 2035 2040 Ala Ser Leu Pro Ile His Val Phe Ile Thr Leu Thr Gln Gly Val 2045 2050 2055 Gly Phe Ser Arg Thr Thr Pro Gly Phe Arg Asn Tyr Gln Ser Ser 2060 2065 2070 His Asp Ser Gln Leu Phe Tyr Ile Gly Leu Arg Ile Leu Lys Thr 2075 2080 2085 Arg Gly Arg Arg Ala Ala Tyr Asn Ile Phe Leu His Glu Ser Val 2090 2095 2100 Gly Gly Thr Asp Tyr Val Asn Ser Arg Glu Ile Ser Cys Glu Met 2105 2110 2115 Val Leu Asp Pro Asp Pro Lys Gly Tyr Thr Ile Val Pro Thr Thr 2120 2125 2130 Ile His Pro Gly Glu Glu Ala Pro Phe Val Leu Ser Val Phe Thr 2135 2140 2145 Lys Ala Ser Ile Val Leu Glu Ala Leu 2150 2155 17 2058 DNA Arabidopsis thaliana CDS (1)..(2058) 17 atg gaa act gaa tct gat gac gcc acc att acc gtc gtc aaa gac atg 48 Met Glu Thr Glu Ser Asp Asp Ala Thr Ile Thr Val Val Lys Asp Met 1 5 10 15 aga gtt cga tta gag aat cga att cga acc caa cac gac gct cac tta 96 Arg Val Arg Leu Glu Asn Arg Ile Arg Thr Gln His Asp Ala His Leu 20 25 30 gat ctc ctt tcc tca ctc caa tcc att gtt ccc gat ata gtc cca tct 144 Asp Leu Leu Ser Ser Leu Gln Ser Ile Val Pro Asp Ile Val Pro Ser 35 40 45 ctc gat ctc tca ctc aaa ctc atc tct tct ttc acg aat cga cct ttc 192 Leu Asp Leu Ser Leu Lys Leu Ile Ser Ser Phe Thr Asn Arg Pro Phe 50 55 60 gta gct aca cca cca tta cca gaa ccc aaa gtt gag aag aag cat cat 240 Val Ala Thr Pro Pro Leu Pro Glu Pro Lys Val Glu Lys Lys His His 65 70 75 80 ccg att gtt aaa tta gga acc caa ctg caa caa tta cat ggg cat gat 288 Pro Ile Val Lys Leu Gly Thr Gln Leu Gln Gln Leu His Gly His Asp 85 90 95 tca aaa tcg atg ctt gtt gat tcg aat cag aga gat gct gaa gct gat 336 Ser Lys Ser Met Leu Val Asp Ser Asn Gln Arg Asp Ala Glu Ala Asp 100 105 110 ggg tct agt ggg agt cct atg gcg ctt gtt aga gct atg gtt gct gag 384 Gly Ser Ser Gly Ser Pro Met Ala Leu Val Arg Ala Met Val Ala Glu 115 120 125 tgt ttg ctt caa cgt gta ccg ttt tct cct acg gat tct tct act gtg 432 Cys Leu Leu Gln Arg Val Pro Phe Ser Pro Thr Asp Ser Ser Thr Val 130 135 140 ctt agg aag ctt gag aat gac cag aat gct aga ccg gcg gag aag gcg 480 Leu Arg Lys Leu Glu Asn Asp Gln Asn Ala Arg Pro Ala Glu Lys Ala 145 150 155 160 gcg tta cga gat tta ggc ggt gaa tgt gga ccg att ctt gcg gtg gag 528 Ala Leu Arg Asp Leu Gly Gly Glu Cys Gly Pro Ile Leu Ala Val Glu 165 170 175 acc gca ctt aaa tcg atg gcg gaa gag aat ggt tct gtg gaa ttg gaa 576 Thr Ala Leu Lys Ser Met Ala Glu Glu Asn Gly Ser Val Glu Leu Glu 180 185 190 gag ttt gag gtt agt ggg aag cct agg atc atg gtt tta gct ata gat 624 Glu Phe Glu Val Ser Gly Lys Pro Arg Ile Met Val Leu Ala Ile Asp 195 200 205 aga act aga ttg ctt aaa gag tta cca gag agt ttc caa ggg aat aac 672 Arg Thr Arg Leu Leu Lys Glu Leu Pro Glu Ser Phe Gln Gly Asn Asn 210 215 220 gaa tcg aat cgg gtt gtc gaa acg cct aat tca att gag aat gct act 720 Glu Ser Asn Arg Val Val Glu Thr Pro Asn Ser Ile Glu Asn Ala Thr 225 230 235 240 gtt tct ggt ggc ggg ttt ggt gtt tcc ggg tcg ggt aat ttc ccg aga 768 Val Ser Gly Gly Gly Phe Gly Val Ser Gly Ser Gly Asn Phe Pro Arg 245 250 255 cct gag atg tgg gga ggt gat cct aat atg ggg ttt agg cca atg atg 816 Pro Glu Met Trp Gly Gly Asp Pro Asn Met Gly Phe Arg Pro Met Met 260 265 270 aat gca cca aga gga atg caa atg atg gga atg cat cat cca atg ggg 864 Asn Ala Pro Arg Gly Met Gln Met Met Gly Met His His Pro Met Gly 275 280 285 ata atg ggt aga cca cca cca ttt cct ttg cct ttg ccg ttg cct gta 912 Ile Met Gly Arg Pro Pro Pro Phe Pro Leu Pro Leu Pro Leu Pro Val 290 295 300 ccg tcg aat cag aag ctt aga agt gag gaa gag gat ttg aaa gat gtt 960 Pro Ser Asn Gln Lys Leu Arg Ser Glu Glu Glu Asp Leu Lys Asp Val 305 310 315 320 gag gct ctt ttg agt aag aag tct ttt aag gag aag caa cag tct agg 1008 Glu Ala Leu Leu Ser Lys Lys Ser Phe Lys Glu Lys Gln Gln Ser Arg 325 330 335 aca ggt gag gag ttg ctt gat ctg att cac cgt cca acc gct aaa gaa 1056 Thr Gly Glu Glu Leu Leu Asp Leu Ile His Arg Pro Thr Ala Lys Glu 340 345 350 gcc gct act gct gct aag ttt aaa agc aaa gga gga tca cag gtt aag 1104 Ala Ala Thr Ala Ala Lys Phe Lys Ser Lys Gly Gly Ser Gln Val Lys 355 360 365 tat tac tgt agg tat tta act aaa gag gat tgc cgt ctt cag tct ggt 1152 Tyr Tyr Cys Arg Tyr Leu Thr Lys Glu Asp Cys Arg Leu Gln Ser Gly 370 375 380 tct cac ata gcc tgc aac aag aga cat ttc cgt cga cta atc gct tca 1200 Ser His Ile Ala Cys Asn Lys Arg His Phe Arg Arg Leu Ile Ala Ser 385 390 395 400 cat act gac gtt agc cta gga gat tgt tcc ttt ctc gat act tgt cgt 1248 His Thr Asp Val Ser Leu Gly Asp Cys Ser Phe Leu Asp Thr Cys Arg 405 410 415 cac atg aag act tgc aaa tac gtg cat tac gag ctc gac atg gct gac 1296 His Met Lys Thr Cys Lys Tyr Val His Tyr Glu Leu Asp Met Ala Asp 420 425 430 gct atg atg gct ggt cca gat aag gca ttg aag cct cta cgt gct gat 1344 Ala Met Met Ala Gly Pro Asp Lys Ala Leu Lys Pro Leu Arg Ala Asp 435 440 445 tac tgt tct gaa gct gaa ctt ggt gag gca caa tgg att aac tgt gac 1392 Tyr Cys Ser Glu Ala Glu Leu Gly Glu Ala Gln Trp Ile Asn Cys Asp 450 455 460 atc cgt tct ttt cga atg gac att ttg gga acc ttt gga gtt gtt atg 1440 Ile Arg Ser Phe Arg Met Asp Ile Leu Gly Thr Phe Gly Val Val Met 465 470 475 480 gcg gat cca cca tgg gac att cac atg gaa ctt cca tat gga aca atg 1488 Ala Asp Pro Pro Trp Asp Ile His Met Glu Leu Pro Tyr Gly Thr Met 485 490 495 gct gat gac gaa atg cga act ctt aat gtt ccc tcg tta cag act gac 1536 Ala Asp Asp Glu Met Arg Thr Leu Asn Val Pro Ser Leu Gln Thr Asp 500 505 510 ggt ttg att ttc ctc tgg gtc act ggt cgt gct atg gaa cta ggc cgt 1584 Gly Leu Ile Phe Leu Trp Val Thr Gly Arg Ala Met Glu Leu Gly Arg 515 520 525 gaa tgt ttg gag ctt tgg gga tac aag cga gtg gaa gag atc ata tgg 1632 Glu Cys Leu Glu Leu Trp Gly Tyr Lys Arg Val Glu Glu Ile Ile Trp 530 535 540 gta aag aca aat caa ctt caa cgt att att cga aca gga aga aca ggt 1680 Val Lys Thr Asn Gln Leu Gln Arg Ile Ile Arg Thr Gly Arg Thr Gly 545 550 555 560 cac tgg ctc aac cat agc aaa gag cat tgt ctg gtt gga atc aaa gga 1728 His Trp Leu Asn His Ser Lys Glu His Cys Leu Val Gly Ile Lys Gly 565 570 575 aat cca gaa gtg aac aga aac atc gac aca gat gtg att gtc gca gag 1776 Asn Pro Glu Val Asn Arg Asn Ile Asp Thr Asp Val Ile Val Ala Glu 580 585 590 gtt aga gaa aca agc cgg aag cca gac gag atg tat gca atg tta gag 1824 Val Arg Glu Thr Ser Arg Lys Pro Asp Glu Met Tyr Ala Met Leu Glu 595 600 605 agg atc atg cca aga gca aga aag ctc gag tta ttc gcc cgt atg cac 1872 Arg Ile Met Pro Arg Ala Arg Lys Leu Glu Leu Phe Ala Arg Met His 610 615 620 aat gct cat gcc gga tgg cta tca ctt ggg aac cag ctt aac ggt gtc 1920 Asn Ala His Ala Gly Trp Leu Ser Leu Gly Asn Gln Leu Asn Gly Val 625 630 635 640 cga tta ata aac gaa ggt ctt cga gct cgg ttt aaa gca tca tac cct 1968 Arg Leu Ile Asn Glu Gly Leu Arg Ala Arg Phe Lys Ala Ser Tyr Pro 645 650 655 gaa ata gat gtc cag cca cca tca cca cca aga gcc tct gca atg gaa 2016 Glu Ile Asp Val Gln Pro Pro Ser Pro Pro Arg Ala Ser Ala Met Glu 660 665 670 aca gac aac gaa cca atg gct att gac tca atc aca gct tag 2058 Thr Asp Asn Glu Pro Met Ala Ile Asp Ser Ile Thr Ala 675 680 685 18 685 PRT Arabidopsis thaliana 18 Met Glu Thr Glu Ser Asp Asp Ala Thr Ile Thr Val Val Lys Asp Met 1 5 10 15 Arg Val Arg Leu Glu Asn Arg Ile Arg Thr Gln His Asp Ala His Leu 20 25 30 Asp Leu Leu Ser Ser Leu Gln Ser Ile Val Pro Asp Ile Val Pro Ser 35 40 45 Leu Asp Leu Ser Leu Lys Leu Ile Ser Ser Phe Thr Asn Arg Pro Phe 50 55 60 Val Ala Thr Pro Pro Leu Pro Glu Pro Lys Val Glu Lys Lys His His 65 70 75 80 Pro Ile Val Lys Leu Gly Thr Gln Leu Gln Gln Leu His Gly His Asp 85 90 95 Ser Lys Ser Met Leu Val Asp Ser Asn Gln Arg Asp Ala Glu Ala Asp 100 105 110 Gly Ser Ser Gly Ser Pro Met Ala Leu Val Arg Ala Met Val Ala Glu 115 120 125 Cys Leu Leu Gln Arg Val Pro Phe Ser Pro Thr Asp Ser Ser Thr Val 130 135 140 Leu Arg Lys Leu Glu Asn Asp Gln Asn Ala Arg Pro Ala Glu Lys Ala 145 150 155 160 Ala Leu Arg Asp Leu Gly Gly Glu Cys Gly Pro Ile Leu Ala Val Glu 165 170 175 Thr Ala Leu Lys Ser Met Ala Glu Glu Asn Gly Ser Val Glu Leu Glu 180 185 190 Glu Phe Glu Val Ser Gly Lys Pro Arg Ile Met Val Leu Ala Ile Asp 195 200 205 Arg Thr Arg Leu Leu Lys Glu Leu Pro Glu Ser Phe Gln Gly Asn Asn 210 215 220 Glu Ser Asn Arg Val Val Glu Thr Pro Asn Ser Ile Glu Asn Ala Thr 225 230 235 240 Val Ser Gly Gly Gly Phe Gly Val Ser Gly Ser Gly Asn Phe Pro Arg 245 250 255 Pro Glu Met Trp Gly Gly Asp Pro Asn Met Gly Phe Arg Pro Met Met 260 265 270 Asn Ala Pro Arg Gly Met Gln Met Met Gly Met His His Pro Met Gly 275 280 285 Ile Met Gly Arg Pro Pro Pro Phe Pro Leu Pro Leu Pro Leu Pro Val 290 295 300 Pro Ser Asn Gln Lys Leu Arg Ser Glu Glu Glu Asp Leu Lys Asp Val 305 310 315 320 Glu Ala Leu Leu Ser Lys Lys Ser Phe Lys Glu Lys Gln Gln Ser Arg 325 330 335 Thr Gly Glu Glu Leu Leu Asp Leu Ile His Arg Pro Thr Ala Lys Glu 340 345 350 Ala Ala Thr Ala Ala Lys Phe Lys Ser Lys Gly Gly Ser Gln Val Lys 355 360 365 Tyr Tyr Cys Arg Tyr Leu Thr Lys Glu Asp Cys Arg Leu Gln Ser Gly 370 375 380 Ser His Ile Ala Cys Asn Lys Arg His Phe Arg Arg Leu Ile Ala Ser 385 390 395 400 His Thr Asp Val Ser Leu Gly Asp Cys Ser Phe Leu Asp Thr Cys Arg 405 410 415 His Met Lys Thr Cys Lys Tyr Val His Tyr Glu Leu Asp Met Ala Asp 420 425 430 Ala Met Met Ala Gly Pro Asp Lys Ala Leu Lys Pro Leu Arg Ala Asp 435 440 445 Tyr Cys Ser Glu Ala Glu Leu Gly Glu Ala Gln Trp Ile Asn Cys Asp 450 455 460 Ile Arg Ser Phe Arg Met Asp Ile Leu Gly Thr Phe Gly Val Val Met 465 470 475 480 Ala Asp Pro Pro Trp Asp Ile His Met Glu Leu Pro Tyr Gly Thr Met 485 490 495 Ala Asp Asp Glu Met Arg Thr Leu Asn Val Pro Ser Leu Gln Thr Asp 500 505 510 Gly Leu Ile Phe Leu Trp Val Thr Gly Arg Ala Met Glu Leu Gly Arg 515 520 525 Glu Cys Leu Glu Leu Trp Gly Tyr Lys Arg Val Glu Glu Ile Ile Trp 530 535 540 Val Lys Thr Asn Gln Leu Gln Arg Ile Ile Arg Thr Gly Arg Thr Gly 545 550 555 560 His Trp Leu Asn His Ser Lys Glu His Cys Leu Val Gly Ile Lys Gly 565 570 575 Asn Pro Glu Val Asn Arg Asn Ile Asp Thr Asp Val Ile Val Ala Glu 580 585 590 Val Arg Glu Thr Ser Arg Lys Pro Asp Glu Met Tyr Ala Met Leu Glu 595 600 605 Arg Ile Met Pro Arg Ala Arg Lys Leu Glu Leu Phe Ala Arg Met His 610 615 620 Asn Ala His Ala Gly Trp Leu Ser Leu Gly Asn Gln Leu Asn Gly Val 625 630 635 640 Arg Leu Ile Asn Glu Gly Leu Arg Ala Arg Phe Lys Ala Ser Tyr Pro 645 650 655 Glu Ile Asp Val Gln Pro Pro Ser Pro Pro Arg Ala Ser Ala Met Glu 660 665 670 Thr Asp Asn Glu Pro Met Ala Ile Asp Ser Ile Thr Ala 675 680 685 19 2397 DNA Arabidopsis thaliana CDS (1)..(2397) 19 atg gat gct tcg gtg gtg aga ttt tcc caa tcg ccg gca aga gtg cca 48 Met Asp Ala Ser Val Val Arg Phe Ser Gln Ser Pro Ala Arg Val Pro 1 5 10 15 ccg gaa ttt gaa cca gat atg gag aag att aaa cgg agg ctg ctc aag 96 Pro Glu Phe Glu Pro Asp Met Glu Lys Ile Lys Arg Arg Leu Leu Lys 20 25 30 tac ggt gtt gat cca acc ccc aaa atc ctg aac aat ctc cga aag aaa 144 Tyr Gly Val Asp Pro Thr Pro Lys Ile Leu Asn Asn Leu Arg Lys Lys 35 40 45 gaa att caa aaa cac aac cgt aga acc aag cgc gaa acc gag tcc gag 192 Glu Ile Gln Lys His Asn Arg Arg Thr Lys Arg Glu Thr Glu Ser Glu 50 55 60 gcg gag gtg tat acg gag gcg cag aaa caa tca atg gag gaa gaa gct 240 Ala Glu Val Tyr Thr Glu Ala Gln Lys Gln Ser Met Glu Glu Glu Ala 65 70 75 80 cgt ttt caa acc ctt aga cgg gaa tac aag caa ttc acg agg tca att 288 Arg Phe Gln Thr Leu Arg Arg Glu Tyr Lys Gln Phe Thr Arg Ser Ile 85 90 95 tct gga aaa agg ggc ggc gat gtt ggt ttg atg gtt ggg aat cca tgg 336 Ser Gly Lys Arg Gly Gly Asp Val Gly Leu Met Val Gly Asn Pro Trp 100 105 110 gaa gga atc gag aga gtg aag ctg aag gag ctc gtt agt ggc gtt cgg 384 Glu Gly Ile Glu Arg Val Lys Leu Lys Glu Leu Val Ser Gly Val Arg 115 120 125 aga gaa gag gtt agt gct ggt gaa ttg aag aaa gag aat cta aaa gag 432 Arg Glu Glu Val Ser Ala Gly Glu Leu Lys Lys Glu Asn Leu Lys Glu 130 135 140 ttg aag aag ata ctt gag aag gat ctt cgt tgg gtt cta gac gac gac 480 Leu Lys Lys Ile Leu Glu Lys Asp Leu Arg Trp Val Leu Asp Asp Asp 145 150 155 160 gtt gat gtg gaa gaa ttt gat ttg gac aaa gaa ttt gat cct gcg aaa 528 Val Asp Val Glu Glu Phe Asp Leu Asp Lys Glu Phe Asp Pro Ala Lys 165 170 175 cgg tgg cgt aac gaa gga gaa gca gtc aga gtt ctc gtt gac aga ttg 576 Arg Trp Arg Asn Glu Gly Glu Ala Val Arg Val Leu Val Asp Arg Leu 180 185 190 agt ggt aga gaa atc aat gag aag cat tgg aag ttt gtg aga atg atg 624 Ser Gly Arg Glu Ile Asn Glu Lys His Trp Lys Phe Val Arg Met Met 195 200 205 aat caa tca ggg ctt cag ttc act gaa gat cag atg ctt aag atc gtt 672 Asn Gln Ser Gly Leu Gln Phe Thr Glu Asp Gln Met Leu Lys Ile Val 210 215 220 gat cga ttg gga cgt aaa cag agc tgg aaa caa gct tca gct gtt gtt 720 Asp Arg Leu Gly Arg Lys Gln Ser Trp Lys Gln Ala Ser Ala Val Val 225 230 235 240 cat tgg gtg tat tct gat aaa aag cgt aaa cat ctt agg agc aga ttt 768 His Trp Val Tyr Ser Asp Lys Lys Arg Lys His Leu Arg Ser Arg Phe 245 250 255 gtt tac acc aag ctt ttg tcc gtt ctt ggg ttt gcg agg agg cca cag 816 Val Tyr Thr Lys Leu Leu Ser Val Leu Gly Phe Ala Arg Arg Pro Gln 260 265 270 gaa gct ctt cag ata ttc aat cag atg ctt ggt gat cgc cag tta tat 864 Glu Ala Leu Gln Ile Phe Asn Gln Met Leu Gly Asp Arg Gln Leu Tyr 275 280 285 cct gat atg gcg gcg tac cac tgt att gct gta aca ctt ggg caa gcg 912 Pro Asp Met Ala Ala Tyr His Cys Ile Ala Val Thr Leu Gly Gln Ala 290 295 300 ggt tta ttg aag gag ttg ctt aaa gta atc gag cgt atg agg cag aaa 960 Gly Leu Leu Lys Glu Leu Leu Lys Val Ile Glu Arg Met Arg Gln Lys 305 310 315 320 ccg act aaa cta act aag aat ttg cgg caa aag aac tgg gat cct gtg 1008 Pro Thr Lys Leu Thr Lys Asn Leu Arg Gln Lys Asn Trp Asp Pro Val 325 330 335 ctt gaa cct gac ttg gtt gta tac aac gct att ctt aac gct tgt gtt 1056 Leu Glu Pro Asp Leu Val Val Tyr Asn Ala Ile Leu Asn Ala Cys Val 340 345 350 cca aca ctc caa tgg aag gct gtt tca tgg gta ttt gta gag tta aga 1104 Pro Thr Leu Gln Trp Lys Ala Val Ser Trp Val Phe Val Glu Leu Arg 355 360 365 aaa aat ggt ttg agg cct aat gga gct aca tat ggg ctt gcg atg gag 1152 Lys Asn Gly Leu Arg Pro Asn Gly Ala Thr Tyr Gly Leu Ala Met Glu 370 375 380 gtt atg ctg gag tca ggg aag ttt gat cgt gtt cac gat ttt ttt agg 1200 Val Met Leu Glu Ser Gly Lys Phe Asp Arg Val His Asp Phe Phe Arg 385 390 395 400 aag atg aaa agc agt ggc gaa gct cca aaa gca atc aca tac aag gtt 1248 Lys Met Lys Ser Ser Gly Glu Ala Pro Lys Ala Ile Thr Tyr Lys Val 405 410 415 ctt gtc cga gct ctc tgg aga gaa ggc aag atc gaa gaa gct gtt gaa 1296 Leu Val Arg Ala Leu Trp Arg Glu Gly Lys Ile Glu Glu Ala Val Glu 420 425 430 gca gtc aga gat atg gaa caa aag gga gtt ata gga acg ggt tca gtt 1344 Ala Val Arg Asp Met Glu Gln Lys Gly Val Ile Gly Thr Gly Ser Val 435 440 445 tac tat gaa ttg gca tgc tgt ctg tgc aac aac ggg cgt tgg tgc gat 1392 Tyr Tyr Glu Leu Ala Cys Cys Leu Cys Asn Asn Gly Arg Trp Cys Asp 450 455 460 gca atg ctc gaa gtg ggg agg atg aaa aga ctt gaa aat tgc agg ccg 1440 Ala Met Leu Glu Val Gly Arg Met Lys Arg Leu Glu Asn Cys Arg Pro 465 470 475 480 ctc gag att acc ttc aca gga ctc ata gcc gct tca ttg aac ggt ggt 1488 Leu Glu Ile Thr Phe Thr Gly Leu Ile Ala Ala Ser Leu Asn Gly Gly 485 490 495 cat gtt gat gat tgc atg gct ata ttc caa tat atg aaa gat aaa tgt 1536 His Val Asp Asp Cys Met Ala Ile Phe Gln Tyr Met Lys Asp Lys Cys 500 505 510 gat ccg aat ata gga act gcg aac atg atg ctt aaa gtt tac gga agg 1584 Asp Pro Asn Ile Gly Thr Ala Asn Met Met Leu Lys Val Tyr Gly Arg 515 520 525 aat gat atg ttt tca gaa gct aaa gaa ttg ttt gaa gag att gtc agc 1632 Asn Asp Met Phe Ser Glu Ala Lys Glu Leu Phe Glu Glu Ile Val Ser 530 535 540 aga aaa gag act cat tta gtt cca aac gag tat aca tac agc ttc atg 1680 Arg Lys Glu Thr His Leu Val Pro Asn Glu Tyr Thr Tyr Ser Phe Met 545 550 555 560 ctt gaa gct tca gct aga tca ctg caa tgg gaa tac ttt gaa cat gtg 1728 Leu Glu Ala Ser Ala Arg Ser Leu Gln Trp Glu Tyr Phe Glu His Val 565 570 575 tat caa acg atg gtt ctt tct ggt tac caa atg gat caa aca aaa cat 1776 Tyr Gln Thr Met Val Leu Ser Gly Tyr Gln Met Asp Gln Thr Lys His 580 585 590 gca tca atg ctg ata gaa gca tca aga gcc ggg aag tgg agt ctt cta 1824 Ala Ser Met Leu Ile Glu Ala Ser Arg Ala Gly Lys Trp Ser Leu Leu 595 600 605 gag cat gcc ttc gat gca gta ctt gaa gat gga gaa atc cct cac cca 1872 Glu His Ala Phe Asp Ala Val Leu Glu Asp Gly Glu Ile Pro His Pro 610 615 620 ttg ttc ttc act gaa ctg ttg tgt cat gcc aca gct aaa ggc gat ttc 1920 Leu Phe Phe Thr Glu Leu Leu Cys His Ala Thr Ala Lys Gly Asp Phe 625 630 635 640 caa aga gct atc aca ctg att aac act gtg gct ctc gca tct ttc cag 1968 Gln Arg Ala Ile Thr Leu Ile Asn Thr Val Ala Leu Ala Ser Phe Gln 645 650 655 att agt gaa gaa gaa tgg act gat ctt ttt gag gag cat caa gac tgg 2016 Ile Ser Glu Glu Glu Trp Thr Asp Leu Phe Glu Glu His Gln Asp Trp 660 665 670 ctt act caa gat aat ctt cat aag ctc tcg gat cat ctc att gag tgt 2064 Leu Thr Gln Asp Asn Leu His Lys Leu Ser Asp His Leu Ile Glu Cys 675 680 685 gat tat gta agt gaa cca acg gtc tca aac ctc tca aag tca ttg aaa 2112 Asp Tyr Val Ser Glu Pro Thr Val Ser Asn Leu Ser Lys Ser Leu Lys 690 695 700 tct cga tgt ggg tct tct tct tct tca gca caa cca ttg tta gcc gta 2160 Ser Arg Cys Gly Ser Ser Ser Ser Ser Ala Gln Pro Leu Leu Ala Val 705 710 715 720 gat gta aca act caa agt cag ggc gaa aaa ccg gag gaa gat ttg ctt 2208 Asp Val Thr Thr Gln Ser Gln Gly Glu Lys Pro Glu Glu Asp Leu Leu 725 730 735 ctg caa gat aca aca atg gaa gat gat aat agt gct aat ggt gaa gct 2256 Leu Gln Asp Thr Thr Met Glu Asp Asp Asn Ser Ala Asn Gly Glu Ala 740 745 750 tgg gaa ttt aca gag act gaa ctc gaa acg ttg ggt cta gaa gaa ctc 2304 Trp Glu Phe Thr Glu Thr Glu Leu Glu Thr Leu Gly Leu Glu Glu Leu 755 760 765 gag att gat gat gat gaa gaa tct agc gac tct gat tca ctt tca gtt 2352 Glu Ile Asp Asp Asp Glu Glu Ser Ser Asp Ser Asp Ser Leu Ser Val 770 775 780 tat gac att ctg aaa gaa tgg gaa gag agt agt aag aaa gaa tga 2397 Tyr Asp Ile Leu Lys Glu Trp Glu Glu Ser Ser Lys Lys Glu 785 790 795 20 798 PRT Arabidopsis thaliana 20 Met Asp Ala Ser Val Val Arg Phe Ser Gln Ser Pro Ala Arg Val Pro 1 5 10 15 Pro Glu Phe Glu Pro Asp Met Glu Lys Ile Lys Arg Arg Leu Leu Lys 20 25 30 Tyr Gly Val Asp Pro Thr Pro Lys Ile Leu Asn Asn Leu Arg Lys Lys 35 40 45 Glu Ile Gln Lys His Asn Arg Arg Thr Lys Arg Glu Thr Glu Ser Glu 50 55 60 Ala Glu Val Tyr Thr Glu Ala Gln Lys Gln Ser Met Glu Glu Glu Ala 65 70 75 80 Arg Phe Gln Thr Leu Arg Arg Glu Tyr Lys Gln Phe Thr Arg Ser Ile 85 90 95 Ser Gly Lys Arg Gly Gly Asp Val Gly Leu Met Val Gly Asn Pro Trp 100 105 110 Glu Gly Ile Glu Arg Val Lys Leu Lys Glu Leu Val Ser Gly Val Arg 115 120 125 Arg Glu Glu Val Ser Ala Gly Glu Leu Lys Lys Glu Asn Leu Lys Glu 130 135 140 Leu Lys Lys Ile Leu Glu Lys Asp Leu Arg Trp Val Leu Asp Asp Asp 145 150 155 160 Val Asp Val Glu Glu Phe Asp Leu Asp Lys Glu Phe Asp Pro Ala Lys 165 170 175 Arg Trp Arg Asn Glu Gly Glu Ala Val Arg Val Leu Val Asp Arg Leu 180 185 190 Ser Gly Arg Glu Ile Asn Glu Lys His Trp Lys Phe Val Arg Met Met 195 200 205 Asn Gln Ser Gly Leu Gln Phe Thr Glu Asp Gln Met Leu Lys Ile Val 210 215 220 Asp Arg Leu Gly Arg Lys Gln Ser Trp Lys Gln Ala Ser Ala Val Val 225 230 235 240 His Trp Val Tyr Ser Asp Lys Lys Arg Lys His Leu Arg Ser Arg Phe 245 250 255 Val Tyr Thr Lys Leu Leu Ser Val Leu Gly Phe Ala Arg Arg Pro Gln 260 265 270 Glu Ala Leu Gln Ile Phe Asn Gln Met Leu Gly Asp Arg Gln Leu Tyr 275 280 285 Pro Asp Met Ala Ala Tyr His Cys Ile Ala Val Thr Leu Gly Gln Ala 290 295 300 Gly Leu Leu Lys Glu Leu Leu Lys Val Ile Glu Arg Met Arg Gln Lys 305 310 315 320 Pro Thr Lys Leu Thr Lys Asn Leu Arg Gln Lys Asn Trp Asp Pro Val 325 330 335 Leu Glu Pro Asp Leu Val Val Tyr Asn Ala Ile Leu Asn Ala Cys Val 340 345 350 Pro Thr Leu Gln Trp Lys Ala Val Ser Trp Val Phe Val Glu Leu Arg 355 360 365 Lys Asn Gly Leu Arg Pro Asn Gly Ala Thr Tyr Gly Leu Ala Met Glu 370 375 380 Val Met Leu Glu Ser Gly Lys Phe Asp Arg Val His Asp Phe Phe Arg 385 390 395 400 Lys Met Lys Ser Ser Gly Glu Ala Pro Lys Ala Ile Thr Tyr Lys Val 405 410 415 Leu Val Arg Ala Leu Trp Arg Glu Gly Lys Ile Glu Glu Ala Val Glu 420 425 430 Ala Val Arg Asp Met Glu Gln Lys Gly Val Ile Gly Thr Gly Ser Val 435 440 445 Tyr Tyr Glu Leu Ala Cys Cys Leu Cys Asn Asn Gly Arg Trp Cys Asp 450 455 460 Ala Met Leu Glu Val Gly Arg Met Lys Arg Leu Glu Asn Cys Arg Pro 465 470 475 480 Leu Glu Ile Thr Phe Thr Gly Leu Ile Ala Ala Ser Leu Asn Gly Gly 485 490 495 His Val Asp Asp Cys Met Ala Ile Phe Gln Tyr Met Lys Asp Lys Cys 500 505 510 Asp Pro Asn Ile Gly Thr Ala Asn Met Met Leu Lys Val Tyr Gly Arg 515 520 525 Asn Asp Met Phe Ser Glu Ala Lys Glu Leu Phe Glu Glu Ile Val Ser 530 535 540 Arg Lys Glu Thr His Leu Val Pro Asn Glu Tyr Thr Tyr Ser Phe Met 545 550 555 560 Leu Glu Ala Ser Ala Arg Ser Leu Gln Trp Glu Tyr Phe Glu His Val 565 570 575 Tyr Gln Thr Met Val Leu Ser Gly Tyr Gln Met Asp Gln Thr Lys His 580 585 590 Ala Ser Met Leu Ile Glu Ala Ser Arg Ala Gly Lys Trp Ser Leu Leu 595 600 605 Glu His Ala Phe Asp Ala Val Leu Glu Asp Gly Glu Ile Pro His Pro 610 615 620 Leu Phe Phe Thr Glu Leu Leu Cys His Ala Thr Ala Lys Gly Asp Phe 625 630 635 640 Gln Arg Ala Ile Thr Leu Ile Asn Thr Val Ala Leu Ala Ser Phe Gln 645 650 655 Ile Ser Glu Glu Glu Trp Thr Asp Leu Phe Glu Glu His Gln Asp Trp 660 665 670 Leu Thr Gln Asp Asn Leu His Lys Leu Ser Asp His Leu Ile Glu Cys 675 680 685 Asp Tyr Val Ser Glu Pro Thr Val Ser Asn Leu Ser Lys Ser Leu Lys 690 695 700 Ser Arg Cys Gly Ser Ser Ser Ser Ser Ala Gln Pro Leu Leu Ala Val 705 710 715 720 Asp Val Thr Thr Gln Ser Gln Gly Glu Lys Pro Glu Glu Asp Leu Leu 725 730 735 Leu Gln Asp Thr Thr Met Glu Asp Asp Asn Ser Ala Asn Gly Glu Ala 740 745 750 Trp Glu Phe Thr Glu Thr Glu Leu Glu Thr Leu Gly Leu Glu Glu Leu 755 760 765 Glu Ile Asp Asp Asp Glu Glu Ser Ser Asp Ser Asp Ser Leu Ser Val 770 775 780 Tyr Asp Ile Leu Lys Glu Trp Glu Glu Ser Ser Lys Lys Glu 785 790 795 21 2511 DNA Arabidopsis thaliana CDS (1)..(2511) 21 atg atc cgt gga aga acc gcg aaa gta att ccc aga aat gtc att ttt 48 Met Ile Arg Gly Arg Thr Ala Lys Val Ile Pro Arg Asn Val Ile Phe 1 5 10 15 acg ctt agt agg agt tcc att tct gag tct cca ttg att tct ccg agt 96 Thr Leu Ser Arg Ser Ser Ile Ser Glu Ser Pro Leu Ile Ser Pro Ser 20 25 30 cga ata aac cct aaa tta gcc ggt agt ttc tcg ttc aac atc cga tta 144 Arg Ile Asn Pro Lys Leu Ala Gly Ser Phe Ser Phe Asn Ile Arg Leu 35 40 45 ttg agc tat ttc act gtt aga aat ggc ttt tgt ccc gat tgt tct gta 192 Leu Ser Tyr Phe Thr Val Arg Asn Gly Phe Cys Pro Asp Cys Ser Val 50 55 60 ccg aga gac cct aat ttt gtt gga ttg act aca caa tgt cgg agt att 240 Pro Arg Asp Pro Asn Phe Val Gly Leu Thr Thr Gln Cys Arg Ser Ile 65 70 75 80 gtc agg agg ttt tgt agt gaa aag att gga agt agt gaa tct agt ggc 288 Val Arg Arg Phe Cys Ser Glu Lys Ile Gly Ser Ser Glu Ser Ser Gly 85 90 95 tgg aca gag gag gtt gag tat tta gat gag tcg ggt agt gta cta cac 336 Trp Thr Glu Glu Val Glu Tyr Leu Asp Glu Ser Gly Ser Val Leu His 100 105 110 agt ggt aag gga ata aga tca gta gag ccg ggg ctt gat gac cat gta 384 Ser Gly Lys Gly Ile Arg Ser Val Glu Pro Gly Leu Asp Asp His Val 115 120 125 atg gtt ggt gga ttg aag aag cct tac atg aat gct tct tct gtt gca 432 Met Val Gly Gly Leu Lys Lys Pro Tyr Met Asn Ala Ser Ser Val Ala 130 135 140 aag att gtt gaa gtt gtt cag agg tgg aaa tgg gga ccg gag ttg gag 480 Lys Ile Val Glu Val Val Gln Arg Trp Lys Trp Gly Pro Glu Leu Glu 145 150 155 160 act cag tta gat aaa ctc cag ttt gtt ccg aat atg gtt cat att acg 528 Thr Gln Leu Asp Lys Leu Gln Phe Val Pro Asn Met Val His Ile Thr 165 170 175 cag tct ttg aag att gtt aaa gag gtt gat gcg gcg ctg agt ctg ttt 576 Gln Ser Leu Lys Ile Val Lys Glu Val Asp Ala Ala Leu Ser Leu Phe 180 185 190 agg tgg gct aag aag cag cct tgg tat ctg cct tcg gat gaa tgt tat 624 Arg Trp Ala Lys Lys Gln Pro Trp Tyr Leu Pro Ser Asp Glu Cys Tyr 195 200 205 gtc gtg ttg ttt gat ggg ttg aac cag gga agg gat ttt gtt ggg att 672 Val Val Leu Phe Asp Gly Leu Asn Gln Gly Arg Asp Phe Val Gly Ile 210 215 220 cag tcg ttg ttt gag gag atg gtt caa gac tcg agt agt cac ggt gat 720 Gln Ser Leu Phe Glu Glu Met Val Gln Asp Ser Ser Ser His Gly Asp 225 230 235 240 ttg tcg ttt aac gcc tat aac caa gtg att cag tat ttg gct aaa gct 768 Leu Ser Phe Asn Ala Tyr Asn Gln Val Ile Gln Tyr Leu Ala Lys Ala 245 250 255 gag aaa ctg gag gtt gct ttt tgt tgt ttc aaa aag gct caa gag tcg 816 Glu Lys Leu Glu Val Ala Phe Cys Cys Phe Lys Lys Ala Gln Glu Ser 260 265 270 gga tgc aaa att gat acg cag aca tat aat aat cta atg atg ttg ttt 864 Gly Cys Lys Ile Asp Thr Gln Thr Tyr Asn Asn Leu Met Met Leu Phe 275 280 285 ctg aac aaa ggt ctc ccg tat aag gca ttt gag att tat gag agc atg 912 Leu Asn Lys Gly Leu Pro Tyr Lys Ala Phe Glu Ile Tyr Glu Ser Met 290 295 300 gag aaa act gat agt ttg ttg gat ggg tca act tat gag ctg ata att 960 Glu Lys Thr Asp Ser Leu Leu Asp Gly Ser Thr Tyr Glu Leu Ile Ile 305 310 315 320 cca agc ttg gcc aaa tcc ggc cgt ctt gat gca gct ttc aag ctt ttt 1008 Pro Ser Leu Ala Lys Ser Gly Arg Leu Asp Ala Ala Phe Lys Leu Phe 325 330 335 cag caa atg aaa gaa agg aaa ctc cga cca agc ttt agt gtg ttt tct 1056 Gln Gln Met Lys Glu Arg Lys Leu Arg Pro Ser Phe Ser Val Phe Ser 340 345 350 tcg ctt gtt gat tca atg ggg aaa gct ggt cga ttg gac aca tca atg 1104 Ser Leu Val Asp Ser Met Gly Lys Ala Gly Arg Leu Asp Thr Ser Met 355 360 365 aag gtt tac atg gaa atg cag gga ttt ggc cat agg cca tca gca act 1152 Lys Val Tyr Met Glu Met Gln Gly Phe Gly His Arg Pro Ser Ala Thr 370 375 380 atg ttt gtt tcc ttg att gat tca tat gct aaa gct ggt aag ctg gat 1200 Met Phe Val Ser Leu Ile Asp Ser Tyr Ala Lys Ala Gly Lys Leu Asp 385 390 395 400 act gct ctt agg ctt tgg gat gag atg aag aag tca ggt ttc agg cca 1248 Thr Ala Leu Arg Leu Trp Asp Glu Met Lys Lys Ser Gly Phe Arg Pro 405 410 415 aac ttt gga ttg tac aca atg atc att gaa tct cat gca aaa tca gga 1296 Asn Phe Gly Leu Tyr Thr Met Ile Ile Glu Ser His Ala Lys Ser Gly 420 425 430 aag ctt gaa gta gca atg acg gtt ttc aaa gac atg gag aaa gct ggg 1344 Lys Leu Glu Val Ala Met Thr Val Phe Lys Asp Met Glu Lys Ala Gly 435 440 445 ttt tta ccg aca cca tcc aca tat tcg tgt cta ttg gag atg cat gct 1392 Phe Leu Pro Thr Pro Ser Thr Tyr Ser Cys Leu Leu Glu Met His Ala 450 455 460 gga tct ggg caa gta gac tct gca atg aaa atc tat aac tcc atg act 1440 Gly Ser Gly Gln Val Asp Ser Ala Met Lys Ile Tyr Asn Ser Met Thr 465 470 475 480 aat gct ggg tta agg cct ggc ctg agc agt tat att tct ctt ctt aca 1488 Asn Ala Gly Leu Arg Pro Gly Leu Ser Ser Tyr Ile Ser Leu Leu Thr 485 490 495 ctt ctg gcc aac aaa aga ctt gtc gat gtt gct ggg aag ata cta ctc 1536 Leu Leu Ala Asn Lys Arg Leu Val Asp Val Ala Gly Lys Ile Leu Leu 500 505 510 gag atg aaa gca atg ggg tat tcc gta gat gtc tgc gct agc gat gtt 1584 Glu Met Lys Ala Met Gly Tyr Ser Val Asp Val Cys Ala Ser Asp Val 515 520 525 ttg atg ata tat atc aaa gat gct tct gtg gat ctt gct ctg aaa tgg 1632 Leu Met Ile Tyr Ile Lys Asp Ala Ser Val Asp Leu Ala Leu Lys Trp 530 535 540 ctt agg ttc atg ggt tct tca ggg atc aaa aca aac aat ttt atc atc 1680 Leu Arg Phe Met Gly Ser Ser Gly Ile Lys Thr Asn Asn Phe Ile Ile 545 550 555 560 agg cag tta ttt gaa tca tgc atg aaa aat ggt cta tac gat tca gct 1728 Arg Gln Leu Phe Glu Ser Cys Met Lys Asn Gly Leu Tyr Asp Ser Ala 565 570 575 agg cct ttg ctg gag aca ctc gtg cat tct gct gga aaa gtt gac ttg 1776 Arg Pro Leu Leu Glu Thr Leu Val His Ser Ala Gly Lys Val Asp Leu 580 585 590 gtg ctt tac act tcg att ctc gcc cat ctt gtc aga tgc caa gac gaa 1824 Val Leu Tyr Thr Ser Ile Leu Ala His Leu Val Arg Cys Gln Asp Glu 595 600 605 gat aaa gag aga caa ttg atg tca atc ctc agc gct acc aag cat aaa 1872 Asp Lys Glu Arg Gln Leu Met Ser Ile Leu Ser Ala Thr Lys His Lys 610 615 620 gct cat gcc ttt atg tgt ggt ctc ttc aca ggt cca gaa cag agg aaa 1920 Ala His Ala Phe Met Cys Gly Leu Phe Thr Gly Pro Glu Gln Arg Lys 625 630 635 640 caa cca gtt cta acg ttt gtc aga gag ttt tac caa ggg att gat tac 1968 Gln Pro Val Leu Thr Phe Val Arg Glu Phe Tyr Gln Gly Ile Asp Tyr 645 650 655 gaa ctc gaa gaa gga gct gct agg tac ttt gtg aat gtc ctt ctc aac 2016 Glu Leu Glu Glu Gly Ala Ala Arg Tyr Phe Val Asn Val Leu Leu Asn 660 665 670 tac ctt gtc ttg atg ggt caa ata aac cga gct cga tgt gtc tgg aaa 2064 Tyr Leu Val Leu Met Gly Gln Ile Asn Arg Ala Arg Cys Val Trp Lys 675 680 685 gta gcg tac gag aac aaa ctc ttt cca aaa gcc atc gtc ttt gat caa 2112 Val Ala Tyr Glu Asn Lys Leu Phe Pro Lys Ala Ile Val Phe Asp Gln 690 695 700 cac att gct tgg tct ctc gat gtg aga aac tta tct gtc gga gct gcg 2160 His Ile Ala Trp Ser Leu Asp Val Arg Asn Leu Ser Val Gly Ala Ala 705 710 715 720 ctc ata gcc gtg gtt cac act ctc cac agg ttc aga aaa cga atg ctt 2208 Leu Ile Ala Val Val His Thr Leu His Arg Phe Arg Lys Arg Met Leu 725 730 735 tac tac gga gta gtc ccg aga cgt ata aag cta gtc aca gga ccg aca 2256 Tyr Tyr Gly Val Val Pro Arg Arg Ile Lys Leu Val Thr Gly Pro Thr 740 745 750 ttg aag att gta att gct caa atg ctg agc tct gtt gag tcg cct ttt 2304 Leu Lys Ile Val Ile Ala Gln Met Leu Ser Ser Val Glu Ser Pro Phe 755 760 765 gaa gtc agc aaa gtc gtc ttg agg gca cca gga gag ttg gtg atg gag 2352 Glu Val Ser Lys Val Val Leu Arg Ala Pro Gly Glu Leu Val Met Glu 770 775 780 tgg ttc aag aaa ccg atc gtg caa cag ttt ctt ctg aac gag atc cca 2400 Trp Phe Lys Lys Pro Ile Val Gln Gln Phe Leu Leu Asn Glu Ile Pro 785 790 795 800 tca cgg tct gat ata ttg atg cat aag atg aac gtg atg ttc cca agc 2448 Ser Arg Ser Asp Ile Leu Met His Lys Met Asn Val Met Phe Pro Ser 805 810 815 tcg gct cct gag ctt aga tct atg tca cct ccc aaa cca ctc atg tca 2496 Ser Ala Pro Glu Leu Arg Ser Met Ser Pro Pro Lys Pro Leu Met Ser 820 825 830 tcg aaa gcg ttc taa 2511 Ser Lys Ala Phe 835 22 836 PRT Arabidopsis thaliana 22 Met Ile Arg Gly Arg Thr Ala Lys Val Ile Pro Arg Asn Val Ile Phe 1 5 10 15 Thr Leu Ser Arg Ser Ser Ile Ser Glu Ser Pro Leu Ile Ser Pro Ser 20 25 30 Arg Ile Asn Pro Lys Leu Ala Gly Ser Phe Ser Phe Asn Ile Arg Leu 35 40 45 Leu Ser Tyr Phe Thr Val Arg Asn Gly Phe Cys Pro Asp Cys Ser Val 50 55 60 Pro Arg Asp Pro Asn Phe Val Gly Leu Thr Thr Gln Cys Arg Ser Ile 65 70 75 80 Val Arg Arg Phe Cys Ser Glu Lys Ile Gly Ser Ser Glu Ser Ser Gly 85 90 95 Trp Thr Glu Glu Val Glu Tyr Leu Asp Glu Ser Gly Ser Val Leu His 100 105 110 Ser Gly Lys Gly Ile Arg Ser Val Glu Pro Gly Leu Asp Asp His Val 115 120 125 Met Val Gly Gly Leu Lys Lys Pro Tyr Met Asn Ala Ser Ser Val Ala 130 135 140 Lys Ile Val Glu Val Val Gln Arg Trp Lys Trp Gly Pro Glu Leu Glu 145 150 155 160 Thr Gln Leu Asp Lys Leu Gln Phe Val Pro Asn Met Val His Ile Thr 165 170 175 Gln Ser Leu Lys Ile Val Lys Glu Val Asp Ala Ala Leu Ser Leu Phe 180 185 190 Arg Trp Ala Lys Lys Gln Pro Trp Tyr Leu Pro Ser Asp Glu Cys Tyr 195 200 205 Val Val Leu Phe Asp Gly Leu Asn Gln Gly Arg Asp Phe Val Gly Ile 210 215 220 Gln Ser Leu Phe Glu Glu Met Val Gln Asp Ser Ser Ser His Gly Asp 225 230 235 240 Leu Ser Phe Asn Ala Tyr Asn Gln Val Ile Gln Tyr Leu Ala Lys Ala 245 250 255 Glu Lys Leu Glu Val Ala Phe Cys Cys Phe Lys Lys Ala Gln Glu Ser 260 265 270 Gly Cys Lys Ile Asp Thr Gln Thr Tyr Asn Asn Leu Met Met Leu Phe 275 280 285 Leu Asn Lys Gly Leu Pro Tyr Lys Ala Phe Glu Ile Tyr Glu Ser Met 290 295 300 Glu Lys Thr Asp Ser Leu Leu Asp Gly Ser Thr Tyr Glu Leu Ile Ile 305 310 315 320 Pro Ser Leu Ala Lys Ser Gly Arg Leu Asp Ala Ala Phe Lys Leu Phe 325 330 335 Gln Gln Met Lys Glu Arg Lys Leu Arg Pro Ser Phe Ser Val Phe Ser 340 345 350 Ser Leu Val Asp Ser Met Gly Lys Ala Gly Arg Leu Asp Thr Ser Met 355 360 365 Lys Val Tyr Met Glu Met Gln Gly Phe Gly His Arg Pro Ser Ala Thr 370 375 380 Met Phe Val Ser Leu Ile Asp Ser Tyr Ala Lys Ala Gly Lys Leu Asp 385 390 395 400 Thr Ala Leu Arg Leu Trp Asp Glu Met Lys Lys Ser Gly Phe Arg Pro 405 410 415 Asn Phe Gly Leu Tyr Thr Met Ile Ile Glu Ser His Ala Lys Ser Gly 420 425 430 Lys Leu Glu Val Ala Met Thr Val Phe Lys Asp Met Glu Lys Ala Gly 435 440 445 Phe Leu Pro Thr Pro Ser Thr Tyr Ser Cys Leu Leu Glu Met His Ala 450 455 460 Gly Ser Gly Gln Val Asp Ser Ala Met Lys Ile Tyr Asn Ser Met Thr 465 470 475 480 Asn Ala Gly Leu Arg Pro Gly Leu Ser Ser Tyr Ile Ser Leu Leu Thr 485 490 495 Leu Leu Ala Asn Lys Arg Leu Val Asp Val Ala Gly Lys Ile Leu Leu 500 505 510 Glu Met Lys Ala Met Gly Tyr Ser Val Asp Val Cys Ala Ser Asp Val 515 520 525 Leu Met Ile Tyr Ile Lys Asp Ala Ser Val Asp Leu Ala Leu Lys Trp 530 535 540 Leu Arg Phe Met Gly Ser Ser Gly Ile Lys Thr Asn Asn Phe Ile Ile 545 550 555 560 Arg Gln Leu Phe Glu Ser Cys Met Lys Asn Gly Leu Tyr Asp Ser Ala 565 570 575 Arg Pro Leu Leu Glu Thr Leu Val His Ser Ala Gly Lys Val Asp Leu 580 585 590 Val Leu Tyr Thr Ser Ile Leu Ala His Leu Val Arg Cys Gln Asp Glu 595 600 605 Asp Lys Glu Arg Gln Leu Met Ser Ile Leu Ser Ala Thr Lys His Lys 610 615 620 Ala His Ala Phe Met Cys Gly Leu Phe Thr Gly Pro Glu Gln Arg Lys 625 630 635 640 Gln Pro Val Leu Thr Phe Val Arg Glu Phe Tyr Gln Gly Ile Asp Tyr 645 650 655 Glu Leu Glu Glu Gly Ala Ala Arg Tyr Phe Val Asn Val Leu Leu Asn 660 665 670 Tyr Leu Val Leu Met Gly Gln Ile Asn Arg Ala Arg Cys Val Trp Lys 675 680 685 Val Ala Tyr Glu Asn Lys Leu Phe Pro Lys Ala Ile Val Phe Asp Gln 690 695 700 His Ile Ala Trp Ser Leu Asp Val Arg Asn Leu Ser Val Gly Ala Ala 705 710 715 720 Leu Ile Ala Val Val His Thr Leu His Arg Phe Arg Lys Arg Met Leu 725 730 735 Tyr Tyr Gly Val Val Pro Arg Arg Ile Lys Leu Val Thr Gly Pro Thr 740 745 750 Leu Lys Ile Val Ile Ala Gln Met Leu Ser Ser Val Glu Ser Pro Phe 755 760 765 Glu Val Ser Lys Val Val Leu Arg Ala Pro Gly Glu Leu Val Met Glu 770 775 780 Trp Phe Lys Lys Pro Ile Val Gln Gln Phe Leu Leu Asn Glu Ile Pro 785 790 795 800 Ser Arg Ser Asp Ile Leu Met His Lys Met Asn Val Met Phe Pro Ser 805 810 815 Ser Ala Pro Glu Leu Arg Ser Met Ser Pro Pro Lys Pro Leu Met Ser 820 825 830 Ser Lys Ala Phe 835 23 2922 DNA Arabidopsis thaliana CDS (1)..(2922) 23 atg agc tct cat cat caa atc ctc caa att cgt tct gac cca ttt gtt 48 Met Ser Ser His His Gln Ile Leu Gln Ile Arg Ser Asp Pro Phe Val 1 5 10 15 cta tca cac tgt tgc cgc cac aca cgc ctc acc agc tct ctt acg ctt 96 Leu Ser His Cys Cys Arg His Thr Arg Leu Thr Ser Ser Leu Thr Leu 20 25 30 caa tct cct cta aag cag ccc ttt tca tgt ctc cct ttc cgt tgg cgc 144 Gln Ser Pro Leu Lys Gln Pro Phe Ser Cys Leu Pro Phe Arg Trp Arg 35 40 45 cgc tct tat cgc ggc gga gtc agg agc tca acg acg gag act cat ggt 192 Arg Ser Tyr Arg Gly Gly Val Arg Ser Ser Thr Thr Glu Thr His Gly 50 55 60 agc aaa aaa gaa gct ttg gtg tcg gaa acg gcg act aca tcg att gaa 240 Ser Lys Lys Glu Ala Leu Val Ser Glu Thr Ala Thr Thr Ser Ile Glu 65 70 75 80 ttg aaa agg gtc tat cca ttt cac gaa att gaa ccc aag tgg caa cgt 288 Leu Lys Arg Val Tyr Pro Phe His Glu Ile Glu Pro Lys Trp Gln Arg 85 90 95 tat tgg gaa gat aac cga att ttc cgg aca cca gac gat gtt gat act 336 Tyr Trp Glu Asp Asn Arg Ile Phe Arg Thr Pro Asp Asp Val Asp Thr 100 105 110 tcg aaa cct aag ttc tac gtc ctc gac atg ttt cca tat cca agt gga 384 Ser Lys Pro Lys Phe Tyr Val Leu Asp Met Phe Pro Tyr Pro Ser Gly 115 120 125 gca gga tta cat gtg gga cat cca ttg ggt tat aca gca act gat att 432 Ala Gly Leu His Val Gly His Pro Leu Gly Tyr Thr Ala Thr Asp Ile 130 135 140 cta gcg agg ctt agg cgt atg caa ggt tac aat gtg ttg cat cct atg 480 Leu Ala Arg Leu Arg Arg Met Gln Gly Tyr Asn Val Leu His Pro Met 145 150 155 160 ggt tgg gat gct ttt gga ttg cct gct gaa caa tat gct att gag aca 528 Gly Trp Asp Ala Phe Gly Leu Pro Ala Glu Gln Tyr Ala Ile Glu Thr 165 170 175 gga acc cat ccg aaa act acg aca ttg aag aac att gac cgc ttt cga 576 Gly Thr His Pro Lys Thr Thr Thr Leu Lys Asn Ile Asp Arg Phe Arg 180 185 190 ttg cag ctc aaa tca ttg ggc ttc tca tac gat tgg gac cgt gaa ctt 624 Leu Gln Leu Lys Ser Leu Gly Phe Ser Tyr Asp Trp Asp Arg Glu Leu 195 200 205 tct aca aca gag cca gac tac tac aaa tgg aca caa tgg atc ttt ctt 672 Ser Thr Thr Glu Pro Asp Tyr Tyr Lys Trp Thr Gln Trp Ile Phe Leu 210 215 220 cag ctt tat aag aaa ggt ttg gca tat cag gct gaa gta ccg gtc aat 720 Gln Leu Tyr Lys Lys Gly Leu Ala Tyr Gln Ala Glu Val Pro Val Asn 225 230 235 240 tgg tgc ccg gct ctt ggt act gtt ttg gcc aat gaa gaa gtg gtg gat 768 Trp Cys Pro Ala Leu Gly Thr Val Leu Ala Asn Glu Glu Val Val Asp 245 250 255 ggt gtt agt gag cgt ggt ggc cac ccg gtt ata aga aag ccg atg agg 816 Gly Val Ser Glu Arg Gly Gly His Pro Val Ile Arg Lys Pro Met Arg 260 265 270 caa tgg atg ctg aag att act gcg tac gct gat cgt ctt cta gaa gat 864 Gln Trp Met Leu Lys Ile Thr Ala Tyr Ala Asp Arg Leu Leu Glu Asp 275 280 285 ttg gac gag ctt gag tgg cct gaa agt ata aag gaa atg caa aga aac 912 Leu Asp Glu Leu Glu Trp Pro Glu Ser Ile Lys Glu Met Gln Arg Asn 290 295 300 tgg ata gga aga tcc gaa gga gct gag ttg aac ttt tca att ctt gat 960 Trp Ile Gly Arg Ser Glu Gly Ala Glu Leu Asn Phe Ser Ile Leu Asp 305 310 315 320 ggg gaa ggc cga gaa act gac aaa gag att aca gtg tac acc acg agg 1008 Gly Glu Gly Arg Glu Thr Asp Lys Glu Ile Thr Val Tyr Thr Thr Arg 325 330 335 cca gat aca ctt ttt gga gca act tac atg gtt gtg gca cca gag cat 1056 Pro Asp Thr Leu Phe Gly Ala Thr Tyr Met Val Val Ala Pro Glu His 340 345 350 cag ttg ttg tct tac ttt gta acc gca gaa cag aaa cag caa gtt gag 1104 Gln Leu Leu Ser Tyr Phe Val Thr Ala Glu Gln Lys Gln Gln Val Glu 355 360 365 gaa tac aaa gat ttc gct tcg agg aaa agt gat ctt gag aga aca gaa 1152 Glu Tyr Lys Asp Phe Ala Ser Arg Lys Ser Asp Leu Glu Arg Thr Glu 370 375 380 ctt cag aag gac aag acg ggc gta ttt acg gga tgt tat gcg aaa aat 1200 Leu Gln Lys Asp Lys Thr Gly Val Phe Thr Gly Cys Tyr Ala Lys Asn 385 390 395 400 cca gct aat gga gat gct att ccc ata tgg gta gct gat tat gtt cta 1248 Pro Ala Asn Gly Asp Ala Ile Pro Ile Trp Val Ala Asp Tyr Val Leu 405 410 415 gct agt tat gga acc gga gca atc atg gcg gtc cca gct cat gat act 1296 Ala Ser Tyr Gly Thr Gly Ala Ile Met Ala Val Pro Ala His Asp Thr 420 425 430 cgg gat aat gag ttt gcg ttg aag tat aac att ccc atc aag tgg gta 1344 Arg Asp Asn Glu Phe Ala Leu Lys Tyr Asn Ile Pro Ile Lys Trp Val 435 440 445 gtg aga aat gag gca aat tca agt gat gat gct aag cag gtt tat ccc 1392 Val Arg Asn Glu Ala Asn Ser Ser Asp Asp Ala Lys Gln Val Tyr Pro 450 455 460 gga ttg ggc att att gaa aat tct tca act tta gaa aca gga ctt gac 1440 Gly Leu Gly Ile Ile Glu Asn Ser Ser Thr Leu Glu Thr Gly Leu Asp 465 470 475 480 ata aat caa cta tct agc aaa gaa gct gct tta aaa gtt att gaa tgg 1488 Ile Asn Gln Leu Ser Ser Lys Glu Ala Ala Leu Lys Val Ile Glu Trp 485 490 495 gct gag aga act ggc aat gga aag aaa aag gta aat tac aag ttg agg 1536 Ala Glu Arg Thr Gly Asn Gly Lys Lys Lys Val Asn Tyr Lys Leu Arg 500 505 510 gat tgg cta ttt gca cgg cag cgt tac tgg ggt gaa cct atc ccg att 1584 Asp Trp Leu Phe Ala Arg Gln Arg Tyr Trp Gly Glu Pro Ile Pro Ile 515 520 525 tta att ttg gat gag tct ggt gaa act att gct att tca gaa tct gaa 1632 Leu Ile Leu Asp Glu Ser Gly Glu Thr Ile Ala Ile Ser Glu Ser Glu 530 535 540 ctg ccc ctc act ttg cct gag tta aat gat ttt acg ccc act gga aca 1680 Leu Pro Leu Thr Leu Pro Glu Leu Asn Asp Phe Thr Pro Thr Gly Thr 545 550 555 560 ggg gaa cca cca ctg tca aaa gca gtt tca tgg gtg aac act gta gat 1728 Gly Glu Pro Pro Leu Ser Lys Ala Val Ser Trp Val Asn Thr Val Asp 565 570 575 cct tca aca gga aaa cct gcc aaa aga gaa aca agc act atg cca caa 1776 Pro Ser Thr Gly Lys Pro Ala Lys Arg Glu Thr Ser Thr Met Pro Gln 580 585 590 tgg gct ggg tct tgc tgg tac tat ctg aga ttc atg gac ccg aaa aac 1824 Trp Ala Gly Ser Cys Trp Tyr Tyr Leu Arg Phe Met Asp Pro Lys Asn 595 600 605 cct gaa gca tta gtt gac aag gaa aaa gaa aaa tac tgg agc cca gta 1872 Pro Glu Ala Leu Val Asp Lys Glu Lys Glu Lys Tyr Trp Ser Pro Val 610 615 620 gat gta tat gtt ggt ggt gct gag cat gct gtt tta cat tta cta tac 1920 Asp Val Tyr Val Gly Gly Ala Glu His Ala Val Leu His Leu Leu Tyr 625 630 635 640 tcg agg ttt tgg cac aag gta ctt tat gac att ggt gtt gtg tct acg 1968 Ser Arg Phe Trp His Lys Val Leu Tyr Asp Ile Gly Val Val Ser Thr 645 650 655 aaa gaa cca ttc aaa tgt gtc ata aat caa gga atc atc ctt ggg gaa 2016 Lys Glu Pro Phe Lys Cys Val Ile Asn Gln Gly Ile Ile Leu Gly Glu 660 665 670 gtc caa tat aca gct tgg aaa gat caa gaa gga aat tat gta tct gca 2064 Val Gln Tyr Thr Ala Trp Lys Asp Gln Glu Gly Asn Tyr Val Ser Ala 675 680 685 gat act gaa gaa cgg tta aat gaa cat caa caa gtg aca atc ccg gaa 2112 Asp Thr Glu Glu Arg Leu Asn Glu His Gln Gln Val Thr Ile Pro Glu 690 695 700 gaa aaa gtt atc aag tcc gga gat cat ttt gtt ctg aag gag gat cct 2160 Glu Lys Val Ile Lys Ser Gly Asp His Phe Val Leu Lys Glu Asp Pro 705 710 715 720 agc att cgt ctg att cca cgc gtt tat aaa atg agc aaa agt aga gga 2208 Ser Ile Arg Leu Ile Pro Arg Val Tyr Lys Met Ser Lys Ser Arg Gly 725 730 735 aat gtc gta aat cct gat gat gtt gta tta gag tat ggt gca gat tct 2256 Asn Val Val Asn Pro Asp Asp Val Val Leu Glu Tyr Gly Ala Asp Ser 740 745 750 cta cgt ttg tat gag atg ttc atg gga cca ttt agg gac tca aaa act 2304 Leu Arg Leu Tyr Glu Met Phe Met Gly Pro Phe Arg Asp Ser Lys Thr 755 760 765 tgg aac acg agt ggg att gaa ggt gtg cat cgt ttc ttg gca aga act 2352 Trp Asn Thr Ser Gly Ile Glu Gly Val His Arg Phe Leu Ala Arg Thr 770 775 780 tgg aga cta gtc att ggt ttg cct cag tct gat ggt tct ttc aaa gat 2400 Trp Arg Leu Val Ile Gly Leu Pro Gln Ser Asp Gly Ser Phe Lys Asp 785 790 795 800 gga acc tta gta act gat gac gaa cca acc ctt gaa caa ctc cga act 2448 Gly Thr Leu Val Thr Asp Asp Glu Pro Thr Leu Glu Gln Leu Arg Thr 805 810 815 ctt cat aaa tgc att gca aag gtg aca gag gaa atc gag agt aca cga 2496 Leu His Lys Cys Ile Ala Lys Val Thr Glu Glu Ile Glu Ser Thr Arg 820 825 830 ttc aac acg gga att tca ggg atg atg gag ttt gtt aat gcg gca tat 2544 Phe Asn Thr Gly Ile Ser Gly Met Met Glu Phe Val Asn Ala Ala Tyr 835 840 845 aag tgg aat aat caa cct aga gga atc att gag cct ttt gtt ctt ttg 2592 Lys Trp Asn Asn Gln Pro Arg Gly Ile Ile Glu Pro Phe Val Leu Leu 850 855 860 ctc tca cct tac gcg cct cac atg gct gaa gag ctt tgg tcg cgg tta 2640 Leu Ser Pro Tyr Ala Pro His Met Ala Glu Glu Leu Trp Ser Arg Leu 865 870 875 880 ggc cat cca aat tct ctg gcc tat gaa agc ttc cct aag gca aac ccg 2688 Gly His Pro Asn Ser Leu Ala Tyr Glu Ser Phe Pro Lys Ala Asn Pro 885 890 895 gat tac tta aag aac aca act att gtt ctt cca gtt cag ata aac ggc 2736 Asp Tyr Leu Lys Asn Thr Thr Ile Val Leu Pro Val Gln Ile Asn Gly 900 905 910 aag aca aga ggc acc ata gag gtt gaa gaa gga tgc tct gaa gat gat 2784 Lys Thr Arg Gly Thr Ile Glu Val Glu Glu Gly Cys Ser Glu Asp Asp 915 920 925 gct ttt gtt cta gct tca cag gac gac aaa ctc agg aaa tat cta gat 2832 Ala Phe Val Leu Ala Ser Gln Asp Asp Lys Leu Arg Lys Tyr Leu Asp 930 935 940 gga caa tca atc aag aag cga atc tat gtg cct ggg aag atc ctc aac 2880 Gly Gln Ser Ile Lys Lys Arg Ile Tyr Val Pro Gly Lys Ile Leu Asn 945 950 955 960 gtt atc cta gac cgg act aat gtc aag gtc act aca aag tag 2922 Val Ile Leu Asp Arg Thr Asn Val Lys Val Thr Thr Lys 965 970 24 973 PRT Arabidopsis thaliana 24 Met Ser Ser His His Gln Ile Leu Gln Ile Arg Ser Asp Pro Phe Val 1 5 10 15 Leu Ser His Cys Cys Arg His Thr Arg Leu Thr Ser Ser Leu Thr Leu 20 25 30 Gln Ser Pro Leu Lys Gln Pro Phe Ser Cys Leu Pro Phe Arg Trp Arg 35 40 45 Arg Ser Tyr Arg Gly Gly Val Arg Ser Ser Thr Thr Glu Thr His Gly 50 55 60 Ser Lys Lys Glu Ala Leu Val Ser Glu Thr Ala Thr Thr Ser Ile Glu 65 70 75 80 Leu Lys Arg Val Tyr Pro Phe His Glu Ile Glu Pro Lys Trp Gln Arg 85 90 95 Tyr Trp Glu Asp Asn Arg Ile Phe Arg Thr Pro Asp Asp Val Asp Thr 100 105 110 Ser Lys Pro Lys Phe Tyr Val Leu Asp Met Phe Pro Tyr Pro Ser Gly 115 120 125 Ala Gly Leu His Val Gly His Pro Leu Gly Tyr Thr Ala Thr Asp Ile 130 135 140 Leu Ala Arg Leu Arg Arg Met Gln Gly Tyr Asn Val Leu His Pro Met 145 150 155 160 Gly Trp Asp Ala Phe Gly Leu Pro Ala Glu Gln Tyr Ala Ile Glu Thr 165 170 175 Gly Thr His Pro Lys Thr Thr Thr Leu Lys Asn Ile Asp Arg Phe Arg 180 185 190 Leu Gln Leu Lys Ser Leu Gly Phe Ser Tyr Asp Trp Asp Arg Glu Leu 195 200 205 Ser Thr Thr Glu Pro Asp Tyr Tyr Lys Trp Thr Gln Trp Ile Phe Leu 210 215 220 Gln Leu Tyr Lys Lys Gly Leu Ala Tyr Gln Ala Glu Val Pro Val Asn 225 230 235 240 Trp Cys Pro Ala Leu Gly Thr Val Leu Ala Asn Glu Glu Val Val Asp 245 250 255 Gly Val Ser Glu Arg Gly Gly His Pro Val Ile Arg Lys Pro Met Arg 260 265 270 Gln Trp Met Leu Lys Ile Thr Ala Tyr Ala Asp Arg Leu Leu Glu Asp 275 280 285 Leu Asp Glu Leu Glu Trp Pro Glu Ser Ile Lys Glu Met Gln Arg Asn 290 295 300 Trp Ile Gly Arg Ser Glu Gly Ala Glu Leu Asn Phe Ser Ile Leu Asp 305 310 315 320 Gly Glu Gly Arg Glu Thr Asp Lys Glu Ile Thr Val Tyr Thr Thr Arg 325 330 335 Pro Asp Thr Leu Phe Gly Ala Thr Tyr Met Val Val Ala Pro Glu His 340 345 350 Gln Leu Leu Ser Tyr Phe Val Thr Ala Glu Gln Lys Gln Gln Val Glu 355 360 365 Glu Tyr Lys Asp Phe Ala Ser Arg Lys Ser Asp Leu Glu Arg Thr Glu 370 375 380 Leu Gln Lys Asp Lys Thr Gly Val Phe Thr Gly Cys Tyr Ala Lys Asn 385 390 395 400 Pro Ala Asn Gly Asp Ala Ile Pro Ile Trp Val Ala Asp Tyr Val Leu 405 410 415 Ala Ser Tyr Gly Thr Gly Ala Ile Met Ala Val Pro Ala His Asp Thr 420 425 430 Arg Asp Asn Glu Phe Ala Leu Lys Tyr Asn Ile Pro Ile Lys Trp Val 435 440 445 Val Arg Asn Glu Ala Asn Ser Ser Asp Asp Ala Lys Gln Val Tyr Pro 450 455 460 Gly Leu Gly Ile Ile Glu Asn Ser Ser Thr Leu Glu Thr Gly Leu Asp 465 470 475 480 Ile Asn Gln Leu Ser Ser Lys Glu Ala Ala Leu Lys Val Ile Glu Trp 485 490 495 Ala Glu Arg Thr Gly Asn Gly Lys Lys Lys Val Asn Tyr Lys Leu Arg 500 505 510 Asp Trp Leu Phe Ala Arg Gln Arg Tyr Trp Gly Glu Pro Ile Pro Ile 515 520 525 Leu Ile Leu Asp Glu Ser Gly Glu Thr Ile Ala Ile Ser Glu Ser Glu 530 535 540 Leu Pro Leu Thr Leu Pro Glu Leu Asn Asp Phe Thr Pro Thr Gly Thr 545 550 555 560 Gly Glu Pro Pro Leu Ser Lys Ala Val Ser Trp Val Asn Thr Val Asp 565 570 575 Pro Ser Thr Gly Lys Pro Ala Lys Arg Glu Thr Ser Thr Met Pro Gln 580 585 590 Trp Ala Gly Ser Cys Trp Tyr Tyr Leu Arg Phe Met Asp Pro Lys Asn 595 600 605 Pro Glu Ala Leu Val Asp Lys Glu Lys Glu Lys Tyr Trp Ser Pro Val 610 615 620 Asp Val Tyr Val Gly Gly Ala Glu His Ala Val Leu His Leu Leu Tyr 625 630 635 640 Ser Arg Phe Trp His Lys Val Leu Tyr Asp Ile Gly Val Val Ser Thr 645 650 655 Lys Glu Pro Phe Lys Cys Val Ile Asn Gln Gly Ile Ile Leu Gly Glu 660 665 670 Val Gln Tyr Thr Ala Trp Lys Asp Gln Glu Gly Asn Tyr Val Ser Ala 675 680 685 Asp Thr Glu Glu Arg Leu Asn Glu His Gln Gln Val Thr Ile Pro Glu 690 695 700 Glu Lys Val Ile Lys Ser Gly Asp His Phe Val Leu Lys Glu Asp Pro 705 710 715 720 Ser Ile Arg Leu Ile Pro Arg Val Tyr Lys Met Ser Lys Ser Arg Gly 725 730 735 Asn Val Val Asn Pro Asp Asp Val Val Leu Glu Tyr Gly Ala Asp Ser 740 745 750 Leu Arg Leu Tyr Glu Met Phe Met Gly Pro Phe Arg Asp Ser Lys Thr 755 760 765 Trp Asn Thr Ser Gly Ile Glu Gly Val His Arg Phe Leu Ala Arg Thr 770 775 780 Trp Arg Leu Val Ile Gly Leu Pro Gln Ser Asp Gly Ser Phe Lys Asp 785 790 795 800 Gly Thr Leu Val Thr Asp Asp Glu Pro Thr Leu Glu Gln Leu Arg Thr 805 810 815 Leu His Lys Cys Ile Ala Lys Val Thr Glu Glu Ile Glu Ser Thr Arg 820 825 830 Phe Asn Thr Gly Ile Ser Gly Met Met Glu Phe Val Asn Ala Ala Tyr 835 840 845 Lys Trp Asn Asn Gln Pro Arg Gly Ile Ile Glu Pro Phe Val Leu Leu 850 855 860 Leu Ser Pro Tyr Ala Pro His Met Ala Glu Glu Leu Trp Ser Arg Leu 865 870 875 880 Gly His Pro Asn Ser Leu Ala Tyr Glu Ser Phe Pro Lys Ala Asn Pro 885 890 895 Asp Tyr Leu Lys Asn Thr Thr Ile Val Leu Pro Val Gln Ile Asn Gly 900 905 910 Lys Thr Arg Gly Thr Ile Glu Val Glu Glu Gly Cys Ser Glu Asp Asp 915 920 925 Ala Phe Val Leu Ala Ser Gln Asp Asp Lys Leu Arg Lys Tyr Leu Asp 930 935 940 Gly Gln Ser Ile Lys Lys Arg Ile Tyr Val Pro Gly Lys Ile Leu Asn 945 950 955 960 Val Ile Leu Asp Arg Thr Asn Val Lys Val Thr Thr Lys 965 970 25 774 DNA Arabidopsis thaliana CDS (1)..(774) 25 atg tcg atg tcg aag agt tca aag atg ctt cag ttc atc aac tac agg 48 Met Ser Met Ser Lys Ser Ser Lys Met Leu Gln Phe Ile Asn Tyr Arg 1 5 10 15 atg cga gtg acg atc caa gac gga aga cag ctc gtt ggg aag ttc atg 96 Met Arg Val Thr Ile Gln Asp Gly Arg Gln Leu Val Gly Lys Phe Met 20 25 30 gcg ttt gac cgt cac atg aac ctc gtt ctc ggc gat tgc gag gag ttt 144 Ala Phe Asp Arg His Met Asn Leu Val Leu Gly Asp Cys Glu Glu Phe 35 40 45 cgt aag ctt cca cca gct aaa ggg aag aag atc aac gaa gag cgt gaa 192 Arg Lys Leu Pro Pro Ala Lys Gly Lys Lys Ile Asn Glu Glu Arg Glu 50 55 60 gat cgc cgt acg cta ggt tta gtg ttg ctt aga ggt gaa gaa gtt atc 240 Asp Arg Arg Thr Leu Gly Leu Val Leu Leu Arg Gly Glu Glu Val Ile 65 70 75 80 tca atg act gtc gaa gga cca cct cct cct gaa gaa tct cgt gct aaa 288 Ser Met Thr Val Glu Gly Pro Pro Pro Pro Glu Glu Ser Arg Ala Lys 85 90 95 gct ggc tct gca gct gct gtt gct ggt cca gga att ggc cgt gcc gct 336 Ala Gly Ser Ala Ala Ala Val Ala Gly Pro Gly Ile Gly Arg Ala Ala 100 105 110 ggg cgt gga gtt cct act ggt ccg tta gtt caa gct cag cct ggt tta 384 Gly Arg Gly Val Pro Thr Gly Pro Leu Val Gln Ala Gln Pro Gly Leu 115 120 125 tct ggt cct gtt cgt ggt gtt ggt gga cct gct ccg gga atg atg cag 432 Ser Gly Pro Val Arg Gly Val Gly Gly Pro Ala Pro Gly Met Met Gln 130 135 140 cct cag atc tct cgt cca cca cag ctt tca gct cct ccg att att cga 480 Pro Gln Ile Ser Arg Pro Pro Gln Leu Ser Ala Pro Pro Ile Ile Arg 145 150 155 160 cct ccg gga cag atg ttg cca ccg cct cct ttt ggt ggt caa ggt cct 528 Pro Pro Gly Gln Met Leu Pro Pro Pro Pro Phe Gly Gly Gln Gly Pro 165 170 175 cct atg gga aga ggt cct cca ccg cct tat ggt atg agg cca ccg cca 576 Pro Met Gly Arg Gly Pro Pro Pro Pro Tyr Gly Met Arg Pro Pro Pro 180 185 190 cag cag ttt tct gga cca cca ccg cct cag tat ggg caa agg cca atg 624 Gln Gln Phe Ser Gly Pro Pro Pro Pro Gln Tyr Gly Gln Arg Pro Met 195 200 205 att cct cct cct ggt ggt atg atg aga gga cct cct cct cca cct cat 672 Ile Pro Pro Pro Gly Gly Met Met Arg Gly Pro Pro Pro Pro Pro His 210 215 220 gga atg caa gga cct cct ccg cct cgc cct gga atg cct cct gct cct 720 Gly Met Gln Gly Pro Pro Pro Pro Arg Pro Gly Met Pro Pro Ala Pro 225 230 235 240 ggt ggt ttt gct cca ccg cgt cct ggc atg cca cca cat aat cag cag 768 Gly Gly Phe Ala Pro Pro Arg Pro Gly Met Pro Pro His Asn Gln Gln 245 250 255 cag tga 774 Gln 26 257 PRT Arabidopsis thaliana 26 Met Ser Met Ser Lys Ser Ser Lys Met Leu Gln Phe Ile Asn Tyr Arg 1 5 10 15 Met Arg Val Thr Ile Gln Asp Gly Arg Gln Leu Val Gly Lys Phe Met 20 25 30 Ala Phe Asp Arg His Met Asn Leu Val Leu Gly Asp Cys Glu Glu Phe 35 40 45 Arg Lys Leu Pro Pro Ala Lys Gly Lys Lys Ile Asn Glu Glu Arg Glu 50 55 60 Asp Arg Arg Thr Leu Gly Leu Val Leu Leu Arg Gly Glu Glu Val Ile 65 70 75 80 Ser Met Thr Val Glu Gly Pro Pro Pro Pro Glu Glu Ser Arg Ala Lys 85 90 95 Ala Gly Ser Ala Ala Ala Val Ala Gly Pro Gly Ile Gly Arg Ala Ala 100 105 110 Gly Arg Gly Val Pro Thr Gly Pro Leu Val Gln Ala Gln Pro Gly Leu 115 120 125 Ser Gly Pro Val Arg Gly Val Gly Gly Pro Ala Pro Gly Met Met Gln 130 135 140 Pro Gln Ile Ser Arg Pro Pro Gln Leu Ser Ala Pro Pro Ile Ile Arg 145 150 155 160 Pro Pro Gly Gln Met Leu Pro Pro Pro Pro Phe Gly Gly Gln Gly Pro 165 170 175 Pro Met Gly Arg Gly Pro Pro Pro Pro Tyr Gly Met Arg Pro Pro Pro 180 185 190 Gln Gln Phe Ser Gly Pro Pro Pro Pro Gln Tyr Gly Gln Arg Pro Met 195 200 205 Ile Pro Pro Pro Gly Gly Met Met Arg Gly Pro Pro Pro Pro Pro His 210 215 220 Gly Met Gln Gly Pro Pro Pro Pro Arg Pro Gly Met Pro Pro Ala Pro 225 230 235 240 Gly Gly Phe Ala Pro Pro Arg Pro Gly Met Pro Pro His Asn Gln Gln 245 250 255 Gln 27 1311 DNA Arabidopsis thaliana CDS (1)..(1311) 27 atg gcg tcg tct tct ctc act tcg aaa tcc att ctc gga tcc acc aaa 48 Met Ala Ser Ser Ser Leu Thr Ser Lys Ser Ile Leu Gly Ser Thr Lys 1 5 10 15 ctc ggt tct tct tct ctt ccc tcg gag ctc cgt cgt ctc tct tct ccc 96 Leu Gly Ser Ser Ser Leu Pro Ser Glu Leu Arg Arg Leu Ser Ser Pro 20 25 30 gcc gtt cag atc tct ctc cgt acc caa acc agg aag aac ttc cag ata 144 Ala Val Gln Ile Ser Leu Arg Thr Gln Thr Arg Lys Asn Phe Gln Ile 35 40 45 caa gct act gga agt tca tat ggg act cat ttt cga gtt tca act ttt 192 Gln Ala Thr Gly Ser Ser Tyr Gly Thr His Phe Arg Val Ser Thr Phe 50 55 60 gga gaa tca cat gga gga gga gtt ggt tgt atc att gat ggt tgt cct 240 Gly Glu Ser His Gly Gly Gly Val Gly Cys Ile Ile Asp Gly Cys Pro 65 70 75 80 cct cgt att cca ctt act gaa tct gat ttg caa ttc gat ctc gat aga 288 Pro Arg Ile Pro Leu Thr Glu Ser Asp Leu Gln Phe Asp Leu Asp Arg 85 90 95 agg agg cct ggt cag agc agg atc aca act cct aga aaa gag act gat 336 Arg Arg Pro Gly Gln Ser Arg Ile Thr Thr Pro Arg Lys Glu Thr Asp 100 105 110 act tgc cgg ata tcg tct gga gtc tct gaa gga atg acg aca gga aca 384 Thr Cys Arg Ile Ser Ser Gly Val Ser Glu Gly Met Thr Thr Gly Thr 115 120 125 cct atc cat gtg ttt gta cca aac aca gat cag aga gga ctt gat tac 432 Pro Ile His Val Phe Val Pro Asn Thr Asp Gln Arg Gly Leu Asp Tyr 130 135 140 agt gaa atg tcg gtt gcc tat aga cca tcg cat gct gat gca act tat 480 Ser Glu Met Ser Val Ala Tyr Arg Pro Ser His Ala Asp Ala Thr Tyr 145 150 155 160 gac atg aag tat ggt gtc aga tca gtg cag ggt gga gga aga tct tca 528 Asp Met Lys Tyr Gly Val Arg Ser Val Gln Gly Gly Gly Arg Ser Ser 165 170 175 gct aga gag acc att gga aga gtt gct cct gga gct ttg gcc aag aaa 576 Ala Arg Glu Thr Ile Gly Arg Val Ala Pro Gly Ala Leu Ala Lys Lys 180 185 190 att ttg aag caa ttt gca gga act gag att ctt gcc tat gtc tcg caa 624 Ile Leu Lys Gln Phe Ala Gly Thr Glu Ile Leu Ala Tyr Val Ser Gln 195 200 205 gtt cac cat gtt gta ctt cca gaa gaa ttg gta gac cac gag aat tta 672 Val His His Val Val Leu Pro Glu Glu Leu Val Asp His Glu Asn Leu 210 215 220 aca ctc gaa cag ata gaa aat aac att gtc aga tgc cct aat ccc gag 720 Thr Leu Glu Gln Ile Glu Asn Asn Ile Val Arg Cys Pro Asn Pro Glu 225 230 235 240 tat gcg gaa aag atg ata gct gcg att gat gct gtc agg aca aaa ggg 768 Tyr Ala Glu Lys Met Ile Ala Ala Ile Asp Ala Val Arg Thr Lys Gly 245 250 255 aac tct gtt ggt ggt gtt gtg acc tgc att gtt cgg aat gct cca cgt 816 Asn Ser Val Gly Gly Val Val Thr Cys Ile Val Arg Asn Ala Pro Arg 260 265 270 ggg ctt ggt aca ccg gtt ttc gat aaa ctt gaa gca gaa ctg gca aaa 864 Gly Leu Gly Thr Pro Val Phe Asp Lys Leu Glu Ala Glu Leu Ala Lys 275 280 285 gct tgt atg tcg cta cct gca act aag gga ttt gag ttt gga agc ggc 912 Ala Cys Met Ser Leu Pro Ala Thr Lys Gly Phe Glu Phe Gly Ser Gly 290 295 300 ttt gca ggt acc ttt ttg act ggt ctt gaa cac aat gat gag ttc tat 960 Phe Ala Gly Thr Phe Leu Thr Gly Leu Glu His Asn Asp Glu Phe Tyr 305 310 315 320 acc gat gaa aat gga aga ata cgt acc aga acc aac cga tct ggt gga 1008 Thr Asp Glu Asn Gly Arg Ile Arg Thr Arg Thr Asn Arg Ser Gly Gly 325 330 335 att cag gga ggg atc tca aat ggt gaa ata ata aac atg aga gta gcc 1056 Ile Gln Gly Gly Ile Ser Asn Gly Glu Ile Ile Asn Met Arg Val Ala 340 345 350 ttc aag cca aca tca aca att gga agg aag caa aat acg gta acc aga 1104 Phe Lys Pro Thr Ser Thr Ile Gly Arg Lys Gln Asn Thr Val Thr Arg 355 360 365 gac aag gta gaa acc gaa atg att gcg cgt ggt cgt cat gat cct tgt 1152 Asp Lys Val Glu Thr Glu Met Ile Ala Arg Gly Arg His Asp Pro Cys 370 375 380 gtt gtt cca cga gct gtg cca atg gtg gaa gca atg gtg gct cta gtt 1200 Val Val Pro Arg Ala Val Pro Met Val Glu Ala Met Val Ala Leu Val 385 390 395 400 ctt gtg gat caa ttg atg gcg caa tac gca caa tgc cat ttg ttt cca 1248 Leu Val Asp Gln Leu Met Ala Gln Tyr Ala Gln Cys His Leu Phe Pro 405 410 415 ata aat cca gag ttg cag gaa cct ctc cag ata gag cag ccg caa aat 1296 Ile Asn Pro Glu Leu Gln Glu Pro Leu Gln Ile Glu Gln Pro Gln Asn 420 425 430 gct act gct ttg taa 1311 Ala Thr Ala Leu 435 28 436 PRT Arabidopsis thaliana 28 Met Ala Ser Ser Ser Leu Thr Ser Lys Ser Ile Leu Gly Ser Thr Lys 1 5 10 15 Leu Gly Ser Ser Ser Leu Pro Ser Glu Leu Arg Arg Leu Ser Ser Pro 20 25 30 Ala Val Gln Ile Ser Leu Arg Thr Gln Thr Arg Lys Asn Phe Gln Ile 35 40 45 Gln Ala Thr Gly Ser Ser Tyr Gly Thr His Phe Arg Val Ser Thr Phe 50 55 60 Gly Glu Ser His Gly Gly Gly Val Gly Cys Ile Ile Asp Gly Cys Pro 65 70 75 80 Pro Arg Ile Pro Leu Thr Glu Ser Asp Leu Gln Phe Asp Leu Asp Arg 85 90 95 Arg Arg Pro Gly Gln Ser Arg Ile Thr Thr Pro Arg Lys Glu Thr Asp 100 105 110 Thr Cys Arg Ile Ser Ser Gly Val Ser Glu Gly Met Thr Thr Gly Thr 115 120 125 Pro Ile His Val Phe Val Pro Asn Thr Asp Gln Arg Gly Leu Asp Tyr 130 135 140 Ser Glu Met Ser Val Ala Tyr Arg Pro Ser His Ala Asp Ala Thr Tyr 145 150 155 160 Asp Met Lys Tyr Gly Val Arg Ser Val Gln Gly Gly Gly Arg Ser Ser 165 170 175 Ala Arg Glu Thr Ile Gly Arg Val Ala Pro Gly Ala Leu Ala Lys Lys 180 185 190 Ile Leu Lys Gln Phe Ala Gly Thr Glu Ile Leu Ala Tyr Val Ser Gln 195 200 205 Val His His Val Val Leu Pro Glu Glu Leu Val Asp His Glu Asn Leu 210 215 220 Thr Leu Glu Gln Ile Glu Asn Asn Ile Val Arg Cys Pro Asn Pro Glu 225 230 235 240 Tyr Ala Glu Lys Met Ile Ala Ala Ile Asp Ala Val Arg Thr Lys Gly 245 250 255 Asn Ser Val Gly Gly Val Val Thr Cys Ile Val Arg Asn Ala Pro Arg 260 265 270 Gly Leu Gly Thr Pro Val Phe Asp Lys Leu Glu Ala Glu Leu Ala Lys 275 280 285 Ala Cys Met Ser Leu Pro Ala Thr Lys Gly Phe Glu Phe Gly Ser Gly 290 295 300 Phe Ala Gly Thr Phe Leu Thr Gly Leu Glu His Asn Asp Glu Phe Tyr 305 310 315 320 Thr Asp Glu Asn Gly Arg Ile Arg Thr Arg Thr Asn Arg Ser Gly Gly 325 330 335 Ile Gln Gly Gly Ile Ser Asn Gly Glu Ile Ile Asn Met Arg Val Ala 340 345 350 Phe Lys Pro Thr Ser Thr Ile Gly Arg Lys Gln Asn Thr Val Thr Arg 355 360 365 Asp Lys Val Glu Thr Glu Met Ile Ala Arg Gly Arg His Asp Pro Cys 370 375 380 Val Val Pro Arg Ala Val Pro Met Val Glu Ala Met Val Ala Leu Val 385 390 395 400 Leu Val Asp Gln Leu Met Ala Gln Tyr Ala Gln Cys His Leu Phe Pro 405 410 415 Ile Asn Pro Glu Leu Gln Glu Pro Leu Gln Ile Glu Gln Pro Gln Asn 420 425 430 Ala Thr Ala Leu 435 29 1038 DNA Arabidopsis thaliana CDS (1)..(1038) 29 atg ctt tcg tta tca tgc tct tct tct tct tct tcg ttg ctt cct ccg 48 Met Leu Ser Leu Ser Cys Ser Ser Ser Ser Ser Ser Leu Leu Pro Pro 1 5 10 15 agt tta cac tac cac ggt tct tct tct gtt cag tcc atc gtt gta cca 96 Ser Leu His Tyr His Gly Ser Ser Ser Val Gln Ser Ile Val Val Pro 20 25 30 aga agg agt ctt atc tcg ttt cgt cgg aaa gtc tct tgc tgt tgc ata 144 Arg Arg Ser Leu Ile Ser Phe Arg Arg Lys Val Ser Cys Cys Cys Ile 35 40 45 gct cca cct cag aac ttg gac aac gat gcc acc aaa ttc gat agt ctt 192 Ala Pro Pro Gln Asn Leu Asp Asn Asp Ala Thr Lys Phe Asp Ser Leu 50 55 60 acc aag tct gga gga ggt atg tgt aaa gag cga ggg ctt gag aat gat 240 Thr Lys Ser Gly Gly Gly Met Cys Lys Glu Arg Gly Leu Glu Asn Asp 65 70 75 80 tct gat gtt ctt att gaa tgt aga gat gtc tat aaa tcg ttt ggg gag 288 Ser Asp Val Leu Ile Glu Cys Arg Asp Val Tyr Lys Ser Phe Gly Glu 85 90 95 aaa cat atc ttg aaa ggt gtt agc ttt aag att aga cat ggt gaa gct 336 Lys His Ile Leu Lys Gly Val Ser Phe Lys Ile Arg His Gly Glu Ala 100 105 110 gtt ggg gtg att ggt cct tct gga act gga aaa tca aca att tta aag 384 Val Gly Val Ile Gly Pro Ser Gly Thr Gly Lys Ser Thr Ile Leu Lys 115 120 125 att atg gct ggt ctt ctt gct cca gac aag gga gaa gtt tat ata cga 432 Ile Met Ala Gly Leu Leu Ala Pro Asp Lys Gly Glu Val Tyr Ile Arg 130 135 140 gga aaa aaa cga gct ggt ttg ata agt gat gag gaa ata tca gga ctt 480 Gly Lys Lys Arg Ala Gly Leu Ile Ser Asp Glu Glu Ile Ser Gly Leu 145 150 155 160 cgt att ggc ctg gta ttt cag agt gca gct ctc ttt gat tca cta tca 528 Arg Ile Gly Leu Val Phe Gln Ser Ala Ala Leu Phe Asp Ser Leu Ser 165 170 175 gtt cgt gaa aat gtt ggt ttt cta ctt tat gaa aga tca aaa atg tcc 576 Val Arg Glu Asn Val Gly Phe Leu Leu Tyr Glu Arg Ser Lys Met Ser 180 185 190 gag aat caa ata tct gag ctt gtg aca caa acc ttg gca gct gtt ggt 624 Glu Asn Gln Ile Ser Glu Leu Val Thr Gln Thr Leu Ala Ala Val Gly 195 200 205 ttg aag ggg gtt gag aat cga tta cct tct gag cta tct ggt gga atg 672 Leu Lys Gly Val Glu Asn Arg Leu Pro Ser Glu Leu Ser Gly Gly Met 210 215 220 aag aaa agg gtt gct tta gct cgt tca cta att ttt gat aca aca aaa 720 Lys Lys Arg Val Ala Leu Ala Arg Ser Leu Ile Phe Asp Thr Thr Lys 225 230 235 240 gag gtc ata gag cca gag gtg ctt ttg tac gat gag cca act gct gga 768 Glu Val Ile Glu Pro Glu Val Leu Leu Tyr Asp Glu Pro Thr Ala Gly 245 250 255 ctt gat cca att gca tca act gta gtt gaa gat ctt ata cgg tct gtt 816 Leu Asp Pro Ile Ala Ser Thr Val Val Glu Asp Leu Ile Arg Ser Val 260 265 270 cac atg aca gac gaa gat gca gtt gga aaa cct gga aaa att gcg tct 864 His Met Thr Asp Glu Asp Ala Val Gly Lys Pro Gly Lys Ile Ala Ser 275 280 285 tat ctt gtt gtt acc cat caa cat agc acc att caa aga gct gta gac 912 Tyr Leu Val Val Thr His Gln His Ser Thr Ile Gln Arg Ala Val Asp 290 295 300 agg tta ttg ttt ctg tat gaa gga aag atc gtt tgg caa gga atg aca 960 Arg Leu Leu Phe Leu Tyr Glu Gly Lys Ile Val Trp Gln Gly Met Thr 305 310 315 320 cat gaa ttc aca acc tca act aat cca ata gtt caa cag ttt gct aca 1008 His Glu Phe Thr Thr Ser Thr Asn Pro Ile Val Gln Gln Phe Ala Thr 325 330 335 ggc agc ctc gat gga cca atc aga tac tag 1038 Gly Ser Leu Asp Gly Pro Ile Arg Tyr 340 345 30 345 PRT Arabidopsis thaliana 30 Met Leu Ser Leu Ser Cys Ser Ser Ser Ser Ser Ser Leu Leu Pro Pro 1 5 10 15 Ser Leu His Tyr His Gly Ser Ser Ser Val Gln Ser Ile Val Val Pro 20 25 30 Arg Arg Ser Leu Ile Ser Phe Arg Arg Lys Val Ser Cys Cys Cys Ile 35 40 45 Ala Pro Pro Gln Asn Leu Asp Asn Asp Ala Thr Lys Phe Asp Ser Leu 50 55 60 Thr Lys Ser Gly Gly Gly Met Cys Lys Glu Arg Gly Leu Glu Asn Asp 65 70 75 80 Ser Asp Val Leu Ile Glu Cys Arg Asp Val Tyr Lys Ser Phe Gly Glu 85 90 95 Lys His Ile Leu Lys Gly Val Ser Phe Lys Ile Arg His Gly Glu Ala 100 105 110 Val Gly Val Ile Gly Pro Ser Gly Thr Gly Lys Ser Thr Ile Leu Lys 115 120 125 Ile Met Ala Gly Leu Leu Ala Pro Asp Lys Gly Glu Val Tyr Ile Arg 130 135 140 Gly Lys Lys Arg Ala Gly Leu Ile Ser Asp Glu Glu Ile Ser Gly Leu 145 150 155 160 Arg Ile Gly Leu Val Phe Gln Ser Ala Ala Leu Phe Asp Ser Leu Ser 165 170 175 Val Arg Glu Asn Val Gly Phe Leu Leu Tyr Glu Arg Ser Lys Met Ser 180 185 190 Glu Asn Gln Ile Ser Glu Leu Val Thr Gln Thr Leu Ala Ala Val Gly 195 200 205 Leu Lys Gly Val Glu Asn Arg Leu Pro Ser Glu Leu Ser Gly Gly Met 210 215 220 Lys Lys Arg Val Ala Leu Ala Arg Ser Leu Ile Phe Asp Thr Thr Lys 225 230 235 240 Glu Val Ile Glu Pro Glu Val Leu Leu Tyr Asp Glu Pro Thr Ala Gly 245 250 255 Leu Asp Pro Ile Ala Ser Thr Val Val Glu Asp Leu Ile Arg Ser Val 260 265 270 His Met Thr Asp Glu Asp Ala Val Gly Lys Pro Gly Lys Ile Ala Ser 275 280 285 Tyr Leu Val Val Thr His Gln His Ser Thr Ile Gln Arg Ala Val Asp 290 295 300 Arg Leu Leu Phe Leu Tyr Glu Gly Lys Ile Val Trp Gln Gly Met Thr 305 310 315 320 His Glu Phe Thr Thr Ser Thr Asn Pro Ile Val Gln Gln Phe Ala Thr 325 330 335 Gly Ser Leu Asp Gly Pro Ile Arg Tyr 340 345 31 4035 DNA Arabidopsis thaliana CDS (1)..(4035) 31 atg gcg tct tct cct ggt aat act aac cct cac aat act cct ccg ttt 48 Met Ala Ser Ser Pro Gly Asn Thr Asn Pro His Asn Thr Pro Pro Phe 1 5 10 15 gat ctc ggt att ctc ttc aaa cct tca tcc aat cct tat ccg cca ccg 96 Asp Leu Gly Ile Leu Phe Lys Pro Ser Ser Asn Pro Tyr Pro Pro Pro 20 25 30 gcg gca tct tat cct ccg cct aca ggt cct ttt ctt cac aac cag tac 144 Ala Ala Ser Tyr Pro Pro Pro Thr Gly Pro Phe Leu His Asn Gln Tyr 35 40 45 gat cag cag cac tac gct ccg cct ggt atc tcc gct caa cca tct ccg 192 Asp Gln Gln His Tyr Ala Pro Pro Gly Ile Ser Ala Gln Pro Ser Pro 50 55 60 gtg act cag cag caa cag gat gtg tct tcc tct tcc gcc gcg act aac 240 Val Thr Gln Gln Gln Gln Asp Val Ser Ser Ser Ser Ala Ala Thr Asn 65 70 75 80 ttg cat cct cag aga acg ctg tct tat ccc acg cca cct ctc aat cta 288 Leu His Pro Gln Arg Thr Leu Ser Tyr Pro Thr Pro Pro Leu Asn Leu 85 90 95 caa tcc ccc cgt tcc aat cac aat ccc ggt acg cac atc ctc gct ctc 336 Gln Ser Pro Arg Ser Asn His Asn Pro Gly Thr His Ile Leu Ala Leu 100 105 110 ctt aac aat acc aac aac gga gcc ccc gtg gct aac caa gag ccg tcg 384 Leu Asn Asn Thr Asn Asn Gly Ala Pro Val Ala Asn Gln Glu Pro Ser 115 120 125 cat cag ctc cca gta gtc aat cac aac gag atc gct cgc tct ttt ccc 432 His Gln Leu Pro Val Val Asn His Asn Glu Ile Ala Arg Ser Phe Pro 130 135 140 ggc ggt tca ggt cct att cgt gta ccg agc tgt aaa ttg ccc aag gga 480 Gly Gly Ser Gly Pro Ile Arg Val Pro Ser Cys Lys Leu Pro Lys Gly 145 150 155 160 agg cga ttg att ggt gaa cac gct gtg tac gat gtt gat gtg aga tta 528 Arg Arg Leu Ile Gly Glu His Ala Val Tyr Asp Val Asp Val Arg Leu 165 170 175 caa ggt gag att cag ccg cag ctg gag gtg act ccg att acc aaa tac 576 Gln Gly Glu Ile Gln Pro Gln Leu Glu Val Thr Pro Ile Thr Lys Tyr 180 185 190 ggg tcg gat cct cag ctc gta gtg ggt agg caa atc gcc gtg aat aag 624 Gly Ser Asp Pro Gln Leu Val Val Gly Arg Gln Ile Ala Val Asn Lys 195 200 205 gtt tac att tgc tat gga ttg aaa gga gga aac att agg gtt ctc aat 672 Val Tyr Ile Cys Tyr Gly Leu Lys Gly Gly Asn Ile Arg Val Leu Asn 210 215 220 atc aac aca gca ttg agg tct ctg ttc cgt ggc cat tca cag aga gtg 720 Ile Asn Thr Ala Leu Arg Ser Leu Phe Arg Gly His Ser Gln Arg Val 225 230 235 240 acg gat atg gct ttc ttt gcc gag gat gtt gat atg ttg gcc agt gtt 768 Thr Asp Met Ala Phe Phe Ala Glu Asp Val Asp Met Leu Ala Ser Val 245 250 255 agc cta gat gga aaa gtt ttt gtg tgg aaa att tct gaa ggg tct gag 816 Ser Leu Asp Gly Lys Val Phe Val Trp Lys Ile Ser Glu Gly Ser Glu 260 265 270 gga gag gat cag ccc caa ata act ggg aag ata gta ctt gct ctt cag 864 Gly Glu Asp Gln Pro Gln Ile Thr Gly Lys Ile Val Leu Ala Leu Gln 275 280 285 ata cta gga gag gaa gac acc aaa cat cca cgc gtt tgt tgg cac tgc 912 Ile Leu Gly Glu Glu Asp Thr Lys His Pro Arg Val Cys Trp His Cys 290 295 300 cac aaa cag gaa att ttg gta gtt tca att ggt aaa cat gta ctg cgt 960 His Lys Gln Glu Ile Leu Val Val Ser Ile Gly Lys His Val Leu Arg 305 310 315 320 att gat act aca aaa gtt ggc aga ggt gaa gta ttc tct gct gag gct 1008 Ile Asp Thr Thr Lys Val Gly Arg Gly Glu Val Phe Ser Ala Glu Ala 325 330 335 cct ctc cag tgt ccc ctt gat aag ctg atc gat ggt gtt cag att gtt 1056 Pro Leu Gln Cys Pro Leu Asp Lys Leu Ile Asp Gly Val Gln Ile Val 340 345 350 ggt aag cat gat gga gaa gtg aca gat ttg tca atg tgc caa tgg atg 1104 Gly Lys His Asp Gly Glu Val Thr Asp Leu Ser Met Cys Gln Trp Met 355 360 365 acc acg cgt ctg gtt tct tct tca gtt gat ggc acg att aag ata tgg 1152 Thr Thr Arg Leu Val Ser Ser Ser Val Asp Gly Thr Ile Lys Ile Trp 370 375 380 caa gat cgt aag gca caa cca ctt gta gtt ttg agg cct cat gat gga 1200 Gln Asp Arg Lys Ala Gln Pro Leu Val Val Leu Arg Pro His Asp Gly 385 390 395 400 cat cca gtc agt tca gcc acg ttt gtg aca tct cct gag aga cct gat 1248 His Pro Val Ser Ser Ala Thr Phe Val Thr Ser Pro Glu Arg Pro Asp 405 410 415 cac atc ata ctt atc acg ggg ggt cct cta aat cga gaa atg aaa atc 1296 His Ile Ile Leu Ile Thr Gly Gly Pro Leu Asn Arg Glu Met Lys Ile 420 425 430 tgg gtc tct gct ggg gaa gaa ggg tgg ctt tta cca gct gat gct gaa 1344 Trp Val Ser Ala Gly Glu Glu Gly Trp Leu Leu Pro Ala Asp Ala Glu 435 440 445 tca tgg agg tgt acc cag act ctt gac ttg aaa agt tca acc gag cca 1392 Ser Trp Arg Cys Thr Gln Thr Leu Asp Leu Lys Ser Ser Thr Glu Pro 450 455 460 cga gct gaa gag gca ttt ttt aac caa gtc ata gca tta tct gaa gca 1440 Arg Ala Glu Glu Ala Phe Phe Asn Gln Val Ile Ala Leu Ser Glu Ala 465 470 475 480 ggc ttg ctt tta ctt gca aat gca aaa agg aac gcc tta tat gct gtg 1488 Gly Leu Leu Leu Leu Ala Asn Ala Lys Arg Asn Ala Leu Tyr Ala Val 485 490 495 cat ttg gac tat ggt tct tct cca gtt ggt acg cgg atg gat tac cta 1536 His Leu Asp Tyr Gly Ser Ser Pro Val Gly Thr Arg Met Asp Tyr Leu 500 505 510 tca gag ttt aca gtc act atg ccc ata ttg agt ttc att ggg aca aat 1584 Ser Glu Phe Thr Val Thr Met Pro Ile Leu Ser Phe Ile Gly Thr Asn 515 520 525 gat cct cca gaa gaa ccc att gtt aag gtt tat tgt gtt cag act cta 1632 Asp Pro Pro Glu Glu Pro Ile Val Lys Val Tyr Cys Val Gln Thr Leu 530 535 540 gca atc cag cag tat aca tta gac ttg tgc tta tgc ttg cca cca cct 1680 Ala Ile Gln Gln Tyr Thr Leu Asp Leu Cys Leu Cys Leu Pro Pro Pro 545 550 555 560 ata gaa aat atg ggt ttg gag aag tca gat tct agt gta tca cga gag 1728 Ile Glu Asn Met Gly Leu Glu Lys Ser Asp Ser Ser Val Ser Arg Glu 565 570 575 gca aat ctt gtt gaa ggc atg tca gaa cca tct gga ctt aaa cct act 1776 Ala Asn Leu Val Glu Gly Met Ser Glu Pro Ser Gly Leu Lys Pro Thr 580 585 590 gac tta cct tca gtt gat tca gtg cca aaa cct tcg att ata gtg aat 1824 Asp Leu Pro Ser Val Asp Ser Val Pro Lys Pro Ser Ile Ile Val Asn 595 600 605 aga tca gaa agt gct aat aag ttg agt ttt cca tca gcg gaa gcc aca 1872 Arg Ser Glu Ser Ala Asn Lys Leu Ser Phe Pro Ser Ala Glu Ala Thr 610 615 620 tca caa gca att gtt cca ccc aac ggt gaa cct aaa act tct ggc ctg 1920 Ser Gln Ala Ile Val Pro Pro Asn Gly Glu Pro Lys Thr Ser Gly Leu 625 630 635 640 cca tct caa acc agt ggt gca ggt tct gcg tat gct act tta ccc caa 1968 Pro Ser Gln Thr Ser Gly Ala Gly Ser Ala Tyr Ala Thr Leu Pro Gln 645 650 655 ctt cct ctc agt ccc aga cta tca agt aaa ctt tct ggc tat cac act 2016 Leu Pro Leu Ser Pro Arg Leu Ser Ser Lys Leu Ser Gly Tyr His Thr 660 665 670 cct gta gag gcg att gag cca gta ata cct cat cat gag ctt ggt ggc 2064 Pro Val Glu Ala Ile Glu Pro Val Ile Pro His His Glu Leu Gly Gly 675 680 685 aaa aca cct tct gct gat tac tct gtt gat agg caa atg gat gct gtt 2112 Lys Thr Pro Ser Ala Asp Tyr Ser Val Asp Arg Gln Met Asp Ala Val 690 695 700 gga gaa aga aat ttg gac gtg tca tcc gta gaa gaa atc tca aga agc 2160 Gly Glu Arg Asn Leu Asp Val Ser Ser Val Glu Glu Ile Ser Arg Ser 705 710 715 720 aag gac tca aat gtt acg cct gat gat gat gtg tct ggg atg cga agc 2208 Lys Asp Ser Asn Val Thr Pro Asp Asp Asp Val Ser Gly Met Arg Ser 725 730 735 cca tca gct ttt ttc aaa cac ccc act cat ctt gta act cct tca gag 2256 Pro Ser Ala Phe Phe Lys His Pro Thr His Leu Val Thr Pro Ser Glu 740 745 750 ata ttg atg ggc gtt tct tcc gct gaa gcc tcc att act act gaa gac 2304 Ile Leu Met Gly Val Ser Ser Ala Glu Ala Ser Ile Thr Thr Glu Asp 755 760 765 agg aga gat agg gat gca aat att cag gac gtg aat aat gat cca aga 2352 Arg Arg Asp Arg Asp Ala Asn Ile Gln Asp Val Asn Asn Asp Pro Arg 770 775 780 gac acc gaa gta gag gtg aaa gaa ata agt gaa gca agg tca aca cag 2400 Asp Thr Glu Val Glu Val Lys Glu Ile Ser Glu Ala Arg Ser Thr Gln 785 790 795 800 aat ggt gaa atc aat gat cat gac gaa act gag aat tgc act tca gaa 2448 Asn Gly Glu Ile Asn Asp His Asp Glu Thr Glu Asn Cys Thr Ser Glu 805 810 815 aat aga gaa aaa gtc ttc tgc tca cag gtt tca aat ctc agc act gag 2496 Asn Arg Glu Lys Val Phe Cys Ser Gln Val Ser Asn Leu Ser Thr Glu 820 825 830 atg gca aga gac tgt tat cct agt act gag gga act ttc att cca ggg 2544 Met Ala Arg Asp Cys Tyr Pro Ser Thr Glu Gly Thr Phe Ile Pro Gly 835 840 845 gaa tct aag gca tat gga caa cct ata aag gct gga gat gaa agt ggt 2592 Glu Ser Lys Ala Tyr Gly Gln Pro Ile Lys Ala Gly Asp Glu Ser Gly 850 855 860 gtt gac tcg aga ggt ggg cct gca aag ctt ctt aaa gga aag aag cag 2640 Val Asp Ser Arg Gly Gly Pro Ala Lys Leu Leu Lys Gly Lys Lys Gln 865 870 875 880 aag gcc aaa aat tca cag ggt ccc ggt ttg tca tct aca tcc tca aat 2688 Lys Ala Lys Asn Ser Gln Gly Pro Gly Leu Ser Ser Thr Ser Ser Asn 885 890 895 gtt gct aat ctg gct gac tcc ttc aat gag caa agt cag agt tta agt 2736 Val Ala Asn Leu Ala Asp Ser Phe Asn Glu Gln Ser Gln Ser Leu Ser 900 905 910 cat ccc atg aca gat tta ctt cct cag ttg tta gca atg caa gaa acg 2784 His Pro Met Thr Asp Leu Leu Pro Gln Leu Leu Ala Met Gln Glu Thr 915 920 925 atg aat cag gtg atg gct tcg cag aag gag atg cag aga caa cta tca 2832 Met Asn Gln Val Met Ala Ser Gln Lys Glu Met Gln Arg Gln Leu Ser 930 935 940 aat gct gcc act ggc cct atc gga aaa gaa agt aaa aga cta gaa gtt 2880 Asn Ala Ala Thr Gly Pro Ile Gly Lys Glu Ser Lys Arg Leu Glu Val 945 950 955 960 gcg tta ggg aga atg att gag aaa tcc agc aag tca aat gct gat gct 2928 Ala Leu Gly Arg Met Ile Glu Lys Ser Ser Lys Ser Asn Ala Asp Ala 965 970 975 cta tgg gcc cgc atc cag gag gag act gtt aag aat gaa aag gca tta 2976 Leu Trp Ala Arg Ile Gln Glu Glu Thr Val Lys Asn Glu Lys Ala Leu 980 985 990 cgt gac cat gcc cag caa att gtg aat gca acg aca aac ttc atg agc 3024 Arg Asp His Ala Gln Gln Ile Val Asn Ala Thr Thr Asn Phe Met Ser 995 1000 1005 aag gag tta aat gcc atg ttt gag aaa acg ata aag aaa gaa ttg 3069 Lys Glu Leu Asn Ala Met Phe Glu Lys Thr Ile Lys Lys Glu Leu 1010 1015 1020 gct gca att ggt cca gcc cta gca cgt tca gta gta cca gtt att 3114 Ala Ala Ile Gly Pro Ala Leu Ala Arg Ser Val Val Pro Val Ile 1025 1030 1035 gaa aaa act gta tct tct gca atc aca gag tcc ttt cag aga gga 3159 Glu Lys Thr Val Ser Ser Ala Ile Thr Glu Ser Phe Gln Arg Gly 1040 1045 1050 ata ggt gac aaa gca gtt aat cag ctt gac aaa tct gtt aat ata 3204 Ile Gly Asp Lys Ala Val Asn Gln Leu Asp Lys Ser Val Asn Ile 1055 1060 1065 aag ctt gaa gca acc gta gct agg caa att caa gcc caa ttt cag 3249 Lys Leu Glu Ala Thr Val Ala Arg Gln Ile Gln Ala Gln Phe Gln 1070 1075 1080 acc tct ggc aag caa gcc ctc cag gaa ggt ctt agg tca agt gtg 3294 Thr Ser Gly Lys Gln Ala Leu Gln Glu Gly Leu Arg Ser Ser Val 1085 1090 1095 gag tcc tca gtc ata cct tcc ttt gag aag gca tgc aag gcc atg 3339 Glu Ser Ser Val Ile Pro Ser Phe Glu Lys Ala Cys Lys Ala Met 1100 1105 1110 ttt gac caa ata gac tca gcc ttc cag aaa ggt att gct gag cat 3384 Phe Asp Gln Ile Asp Ser Ala Phe Gln Lys Gly Ile Ala Glu His 1115 1120 1125 aca aat gca gcc cag caa cgg ttt gac tct gga cac tcc cag ctt 3429 Thr Asn Ala Ala Gln Gln Arg Phe Asp Ser Gly His Ser Gln Leu 1130 1135 1140 gct cat act cta aag gaa tca att act tct gcg tcg tca gtt gct 3474 Ala His Thr Leu Lys Glu Ser Ile Thr Ser Ala Ser Ser Val Ala 1145 1150 1155 caa gcc tta agt cgt gag tta gcc gag acc caa agg aat ctc tta 3519 Gln Ala Leu Ser Arg Glu Leu Ala Glu Thr Gln Arg Asn Leu Leu 1160 1165 1170 gct ctc gca gct gct gga gca aat tct ggt ggg tct aat tcc ttg 3564 Ala Leu Ala Ala Ala Gly Ala Asn Ser Gly Gly Ser Asn Ser Leu 1175 1180 1185 gtt act caa cta agt ggc gga cct ttg ggt gct ctt ctt gaa aag 3609 Val Thr Gln Leu Ser Gly Gly Pro Leu Gly Ala Leu Leu Glu Lys 1190 1195 1200 gtt gaa gca cct atg gac cca aca aca gaa cta tcg agg ttg ata 3654 Val Glu Ala Pro Met Asp Pro Thr Thr Glu Leu Ser Arg Leu Ile 1205 1210 1215 tct gaa cgg aag tac gaa gaa tct ttc act tcg gcc cta cag aga 3699 Ser Glu Arg Lys Tyr Glu Glu Ser Phe Thr Ser Ala Leu Gln Arg 1220 1225 1230 agc gat gtc tct ata gta tca tgg ctt tgc tca cag gtg gat cta 3744 Ser Asp Val Ser Ile Val Ser Trp Leu Cys Ser Gln Val Asp Leu 1235 1240 1245 cgt gga tta ctg gcg atg aat ccg ctt cca ctg agc caa ggc gtg 3789 Arg Gly Leu Leu Ala Met Asn Pro Leu Pro Leu Ser Gln Gly Val 1250 1255 1260 ctt ctt tca ctg ctg cag cag cta gcc tgt gac att agc aag gac 3834 Leu Leu Ser Leu Leu Gln Gln Leu Ala Cys Asp Ile Ser Lys Asp 1265 1270 1275 aca tcc cgt aag ctg gct tgg atg act gat gtg gtt gca gcc ata 3879 Thr Ser Arg Lys Leu Ala Trp Met Thr Asp Val Val Ala Ala Ile 1280 1285 1290 aac cca tca gac cag atg att gcg gtc cac gct cgg cca atc ttt 3924 Asn Pro Ser Asp Gln Met Ile Ala Val His Ala Arg Pro Ile Phe 1295 1300 1305 gaa cag gta tat cag att ctg cac cat cac cgc aac gca ccg gga 3969 Glu Gln Val Tyr Gln Ile Leu His His His Arg Asn Ala Pro Gly 1310 1315 1320 agc gat gtc tca gcc atc aga ctg ata atg cac gtg atc aac tcc 4014 Ser Asp Val Ser Ala Ile Arg Leu Ile Met His Val Ile Asn Ser 1325 1330 1335 atg ctt atg ggt tgc aaa tga 4035 Met Leu Met Gly Cys Lys 1340 32 1344 PRT Arabidopsis thaliana 32 Met Ala Ser Ser Pro Gly Asn Thr Asn Pro His Asn Thr Pro Pro Phe 1 5 10 15 Asp Leu Gly Ile Leu Phe Lys Pro Ser Ser Asn Pro Tyr Pro Pro Pro 20 25 30 Ala Ala Ser Tyr Pro Pro Pro Thr Gly Pro Phe Leu His Asn Gln Tyr 35 40 45 Asp Gln Gln His Tyr Ala Pro Pro Gly Ile Ser Ala Gln Pro Ser Pro 50 55 60 Val Thr Gln Gln Gln Gln Asp Val Ser Ser Ser Ser Ala Ala Thr Asn 65 70 75 80 Leu His Pro Gln Arg Thr Leu Ser Tyr Pro Thr Pro Pro Leu Asn Leu 85 90 95 Gln Ser Pro Arg Ser Asn His Asn Pro Gly Thr His Ile Leu Ala Leu 100 105 110 Leu Asn Asn Thr Asn Asn Gly Ala Pro Val Ala Asn Gln Glu Pro Ser 115 120 125 His Gln Leu Pro Val Val Asn His Asn Glu Ile Ala Arg Ser Phe Pro 130 135 140 Gly Gly Ser Gly Pro Ile Arg Val Pro Ser Cys Lys Leu Pro Lys Gly 145 150 155 160 Arg Arg Leu Ile Gly Glu His Ala Val Tyr Asp Val Asp Val Arg Leu 165 170 175 Gln Gly Glu Ile Gln Pro Gln Leu Glu Val Thr Pro Ile Thr Lys Tyr 180 185 190 Gly Ser Asp Pro Gln Leu Val Val Gly Arg Gln Ile Ala Val Asn Lys 195 200 205 Val Tyr Ile Cys Tyr Gly Leu Lys Gly Gly Asn Ile Arg Val Leu Asn 210 215 220 Ile Asn Thr Ala Leu Arg Ser Leu Phe Arg Gly His Ser Gln Arg Val 225 230 235 240 Thr Asp Met Ala Phe Phe Ala Glu Asp Val Asp Met Leu Ala Ser Val 245 250 255 Ser Leu Asp Gly Lys Val Phe Val Trp Lys Ile Ser Glu Gly Ser Glu 260 265 270 Gly Glu Asp Gln Pro Gln Ile Thr Gly Lys Ile Val Leu Ala Leu Gln 275 280 285 Ile Leu Gly Glu Glu Asp Thr Lys His Pro Arg Val Cys Trp His Cys 290 295 300 His Lys Gln Glu Ile Leu Val Val Ser Ile Gly Lys His Val Leu Arg 305 310 315 320 Ile Asp Thr Thr Lys Val Gly Arg Gly Glu Val Phe Ser Ala Glu Ala 325 330 335 Pro Leu Gln Cys Pro Leu Asp Lys Leu Ile Asp Gly Val Gln Ile Val 340 345 350 Gly Lys His Asp Gly Glu Val Thr Asp Leu Ser Met Cys Gln Trp Met 355 360 365 Thr Thr Arg Leu Val Ser Ser Ser Val Asp Gly Thr Ile Lys Ile Trp 370 375 380 Gln Asp Arg Lys Ala Gln Pro Leu Val Val Leu Arg Pro His Asp Gly 385 390 395 400 His Pro Val Ser Ser Ala Thr Phe Val Thr Ser Pro Glu Arg Pro Asp 405 410 415 His Ile Ile Leu Ile Thr Gly Gly Pro Leu Asn Arg Glu Met Lys Ile 420 425 430 Trp Val Ser Ala Gly Glu Glu Gly Trp Leu Leu Pro Ala Asp Ala Glu 435 440 445 Ser Trp Arg Cys Thr Gln Thr Leu Asp Leu Lys Ser Ser Thr Glu Pro 450 455 460 Arg Ala Glu Glu Ala Phe Phe Asn Gln Val Ile Ala Leu Ser Glu Ala 465 470 475 480 Gly Leu Leu Leu Leu Ala Asn Ala Lys Arg Asn Ala Leu Tyr Ala Val 485 490 495 His Leu Asp Tyr Gly Ser Ser Pro Val Gly Thr Arg Met Asp Tyr Leu 500 505 510 Ser Glu Phe Thr Val Thr Met Pro Ile Leu Ser Phe Ile Gly Thr Asn 515 520 525 Asp Pro Pro Glu Glu Pro Ile Val Lys Val Tyr Cys Val Gln Thr Leu 530 535 540 Ala Ile Gln Gln Tyr Thr Leu Asp Leu Cys Leu Cys Leu Pro Pro Pro 545 550 555 560 Ile Glu Asn Met Gly Leu Glu Lys Ser Asp Ser Ser Val Ser Arg Glu 565 570 575 Ala Asn Leu Val Glu Gly Met Ser Glu Pro Ser Gly Leu Lys Pro Thr 580 585 590 Asp Leu Pro Ser Val Asp Ser Val Pro Lys Pro Ser Ile Ile Val Asn 595 600 605 Arg Ser Glu Ser Ala Asn Lys Leu Ser Phe Pro Ser Ala Glu Ala Thr 610 615 620 Ser Gln Ala Ile Val Pro Pro Asn Gly Glu Pro Lys Thr Ser Gly Leu 625 630 635 640 Pro Ser Gln Thr Ser Gly Ala Gly Ser Ala Tyr Ala Thr Leu Pro Gln 645 650 655 Leu Pro Leu Ser Pro Arg Leu Ser Ser Lys Leu Ser Gly Tyr His Thr 660 665 670 Pro Val Glu Ala Ile Glu Pro Val Ile Pro His His Glu Leu Gly Gly 675 680 685 Lys Thr Pro Ser Ala Asp Tyr Ser Val Asp Arg Gln Met Asp Ala Val 690 695 700 Gly Glu Arg Asn Leu Asp Val Ser Ser Val Glu Glu Ile Ser Arg Ser 705 710 715 720 Lys Asp Ser Asn Val Thr Pro Asp Asp Asp Val Ser Gly Met Arg Ser 725 730 735 Pro Ser Ala Phe Phe Lys His Pro Thr His Leu Val Thr Pro Ser Glu 740 745 750 Ile Leu Met Gly Val Ser Ser Ala Glu Ala Ser Ile Thr Thr Glu Asp 755 760 765 Arg Arg Asp Arg Asp Ala Asn Ile Gln Asp Val Asn Asn Asp Pro Arg 770 775 780 Asp Thr Glu Val Glu Val Lys Glu Ile Ser Glu Ala Arg Ser Thr Gln 785 790 795 800 Asn Gly Glu Ile Asn Asp His Asp Glu Thr Glu Asn Cys Thr Ser Glu 805 810 815 Asn Arg Glu Lys Val Phe Cys Ser Gln Val Ser Asn Leu Ser Thr Glu 820 825 830 Met Ala Arg Asp Cys Tyr Pro Ser Thr Glu Gly Thr Phe Ile Pro Gly 835 840 845 Glu Ser Lys Ala Tyr Gly Gln Pro Ile Lys Ala Gly Asp Glu Ser Gly 850 855 860 Val Asp Ser Arg Gly Gly Pro Ala Lys Leu Leu Lys Gly Lys Lys Gln 865 870 875 880 Lys Ala Lys Asn Ser Gln Gly Pro Gly Leu Ser Ser Thr Ser Ser Asn 885 890 895 Val Ala Asn Leu Ala Asp Ser Phe Asn Glu Gln Ser Gln Ser Leu Ser 900 905 910 His Pro Met Thr Asp Leu Leu Pro Gln Leu Leu Ala Met Gln Glu Thr 915 920 925 Met Asn Gln Val Met Ala Ser Gln Lys Glu Met Gln Arg Gln Leu Ser 930 935 940 Asn Ala Ala Thr Gly Pro Ile Gly Lys Glu Ser Lys Arg Leu Glu Val 945 950 955 960 Ala Leu Gly Arg Met Ile Glu Lys Ser Ser Lys Ser Asn Ala Asp Ala 965 970 975 Leu Trp Ala Arg Ile Gln Glu Glu Thr Val Lys Asn Glu Lys Ala Leu 980 985 990 Arg Asp His Ala Gln Gln Ile Val Asn Ala Thr Thr Asn Phe Met Ser 995 1000 1005 Lys Glu Leu Asn Ala Met Phe Glu Lys Thr Ile Lys Lys Glu Leu 1010 1015 1020 Ala Ala Ile Gly Pro Ala Leu Ala Arg Ser Val Val Pro Val Ile 1025 1030 1035 Glu Lys Thr Val Ser Ser Ala Ile Thr Glu Ser Phe Gln Arg Gly 1040 1045 1050 Ile Gly Asp Lys Ala Val Asn Gln Leu Asp Lys Ser Val Asn Ile 1055 1060 1065 Lys Leu Glu Ala Thr Val Ala Arg Gln Ile Gln Ala Gln Phe Gln 1070 1075 1080 Thr Ser Gly Lys Gln Ala Leu Gln Glu Gly Leu Arg Ser Ser Val 1085 1090 1095 Glu Ser Ser Val Ile Pro Ser Phe Glu Lys Ala Cys Lys Ala Met 1100 1105 1110 Phe Asp Gln Ile Asp Ser Ala Phe Gln Lys Gly Ile Ala Glu His 1115 1120 1125 Thr Asn Ala Ala Gln Gln Arg Phe Asp Ser Gly His Ser Gln Leu 1130 1135 1140 Ala His Thr Leu Lys Glu Ser Ile Thr Ser Ala Ser Ser Val Ala 1145 1150 1155 Gln Ala Leu Ser Arg Glu Leu Ala Glu Thr Gln Arg Asn Leu Leu 1160 1165 1170 Ala Leu Ala Ala Ala Gly Ala Asn Ser Gly Gly Ser Asn Ser Leu 1175 1180 1185 Val Thr Gln Leu Ser Gly Gly Pro Leu Gly Ala Leu Leu Glu Lys 1190 1195 1200 Val Glu Ala Pro Met Asp Pro Thr Thr Glu Leu Ser Arg Leu Ile 1205 1210 1215 Ser Glu Arg Lys Tyr Glu Glu Ser Phe Thr Ser Ala Leu Gln Arg 1220 1225 1230 Ser Asp Val Ser Ile Val Ser Trp Leu Cys Ser Gln Val Asp Leu 1235 1240 1245 Arg Gly Leu Leu Ala Met Asn Pro Leu Pro Leu Ser Gln Gly Val 1250 1255 1260 Leu Leu Ser Leu Leu Gln Gln Leu Ala Cys Asp Ile Ser Lys Asp 1265 1270 1275 Thr Ser Arg Lys Leu Ala Trp Met Thr Asp Val Val Ala Ala Ile 1280 1285 1290 Asn Pro Ser Asp Gln Met Ile Ala Val His Ala Arg Pro Ile Phe 1295 1300 1305 Glu Gln Val Tyr Gln Ile Leu His His His Arg Asn Ala Pro Gly 1310 1315 1320 Ser Asp Val Ser Ala Ile Arg Leu Ile Met His Val Ile Asn Ser 1325 1330 1335 Met Leu Met Gly Cys Lys 1340 33 3036 DNA Arabidopsis thaliana CDS (1)..(3036) 33 atg ttg ctt cca ctg ttt cac cag caa cca ttg ata ctt gcc aaa acc 48 Met Leu Leu Pro Leu Phe His Gln Gln Pro Leu Ile Leu Ala Lys Thr 1 5 10 15 ttt ccc gat cga att ttc ccg cca ttc ctt gtt ccc aac acc cta gtt 96 Phe Pro Asp Arg Ile Phe Pro Pro Phe Leu Val Pro Asn Thr Leu Val 20 25 30 tcg cga cgc aat gtc agt cga gcg aac tcc ggc atc ttc tgt agt tct 144 Ser Arg Arg Asn Val Ser Arg Ala Asn Ser Gly Ile Phe Cys Ser Ser 35 40 45 gct tcc ggt cga aag act ctt ccc caa tcg gct atc cag aga atc gct 192 Ala Ser Gly Arg Lys Thr Leu Pro Gln Ser Ala Ile Gln Arg Ile Ala 50 55 60 gag aag ctt cgt agc ctt gga ttc gtg gaa gag aag cat gat tca ccg 240 Glu Lys Leu Arg Ser Leu Gly Phe Val Glu Glu Lys His Asp Ser Pro 65 70 75 80 act agg aga atc acg gga gaa gaa tct ggt aag aat tcc cca gga gag 288 Thr Arg Arg Ile Thr Gly Glu Glu Ser Gly Lys Asn Ser Pro Gly Glu 85 90 95 ata ttc gtt ccg ttg cct aag cag ctt ccg att cac cgg gtc ggg cac 336 Ile Phe Val Pro Leu Pro Lys Gln Leu Pro Ile His Arg Val Gly His 100 105 110 acg att gac aca agc tgg agc acg cct agt tac ccg gtt ccg aag ccc 384 Thr Ile Asp Thr Ser Trp Ser Thr Pro Ser Tyr Pro Val Pro Lys Pro 115 120 125 ggg tcg ggt act gcc att tca agg tac cat gag ttg aag aga gtg tgg 432 Gly Ser Gly Thr Ala Ile Ser Arg Tyr His Glu Leu Lys Arg Val Trp 130 135 140 aag aag gaa acg gag atg gag agg aag aag gag gag aag gtt ccg tcg 480 Lys Lys Glu Thr Glu Met Glu Arg Lys Lys Glu Glu Lys Val Pro Ser 145 150 155 160 ttg gcg gaa ttg aca ttg ccg ccg gcg gag ctg agg cgg ttg agg acg 528 Leu Ala Glu Leu Thr Leu Pro Pro Ala Glu Leu Arg Arg Leu Arg Thr 165 170 175 gtg ggg att agg ttg acg aag aag ctg aag att ggc aag gct ggg att 576 Val Gly Ile Arg Leu Thr Lys Lys Leu Lys Ile Gly Lys Ala Gly Ile 180 185 190 acg gaa ggg att gtg aat ggg att cac gag aga tgg agg acg acg gag 624 Thr Glu Gly Ile Val Asn Gly Ile His Glu Arg Trp Arg Thr Thr Glu 195 200 205 gtt gtg aag att ttc tgc gaa gat att tcc aga atg aac atg aaa cgg 672 Val Val Lys Ile Phe Cys Glu Asp Ile Ser Arg Met Asn Met Lys Arg 210 215 220 acc cat gac gtc ttg gag aca aaa act gga ggc tta gtc att tgg agg 720 Thr His Asp Val Leu Glu Thr Lys Thr Gly Gly Leu Val Ile Trp Arg 225 230 235 240 tct gga agc aaa ata ttg ttg tat aga ggg gta aat tat cag tac cct 768 Ser Gly Ser Lys Ile Leu Leu Tyr Arg Gly Val Asn Tyr Gln Tyr Pro 245 250 255 tat ttt gta tct gat cgg gat ttg gca cat gaa gct gct tca gga gca 816 Tyr Phe Val Ser Asp Arg Asp Leu Ala His Glu Ala Ala Ser Gly Ala 260 265 270 tca tct atg gat caa gga gtg gtt gat agt aga gag aaa cag agt att 864 Ser Ser Met Asp Gln Gly Val Val Asp Ser Arg Glu Lys Gln Ser Ile 275 280 285 gca gaa tct tct gcc ccg agt att act aat aaa atg gtt aaa ccg atg 912 Ala Glu Ser Ser Ala Pro Ser Ile Thr Asn Lys Met Val Lys Pro Met 290 295 300 ttg acg caa ggc gtt ggt tct ccg gat aag gtt agg ttt caa tta ccc 960 Leu Thr Gln Gly Val Gly Ser Pro Asp Lys Val Arg Phe Gln Leu Pro 305 310 315 320 ggg gaa gta cag ctt gta gaa gaa gca gac cgt ttg ttg gaa gga ttg 1008 Gly Glu Val Gln Leu Val Glu Glu Ala Asp Arg Leu Leu Glu Gly Leu 325 330 335 ggt cca aga ttt act gac tgg tgg gca tat gat ccc ctt ccc gta gat 1056 Gly Pro Arg Phe Thr Asp Trp Trp Ala Tyr Asp Pro Leu Pro Val Asp 340 345 350 ggt gat ctt ctg cca gcc gta gta cct gac tac agg aga cca ttt cgc 1104 Gly Asp Leu Leu Pro Ala Val Val Pro Asp Tyr Arg Arg Pro Phe Arg 355 360 365 ctt ctt cca tat ggc gtc agt cca aaa cta act gat gat gaa atg acc 1152 Leu Leu Pro Tyr Gly Val Ser Pro Lys Leu Thr Asp Asp Glu Met Thr 370 375 380 aca ata agg aga ctt ggt aga cca ctc cct tgt cat ttt gct tta ggt 1200 Thr Ile Arg Arg Leu Gly Arg Pro Leu Pro Cys His Phe Ala Leu Gly 385 390 395 400 agg aat aga aat ttg cag gga cta gct gtt gca att gtc aag ctt tgg 1248 Arg Asn Arg Asn Leu Gln Gly Leu Ala Val Ala Ile Val Lys Leu Trp 405 410 415 gag aaa tgt gag ctt gcc aag ata gca gtg aag aga gga gtt cag aac 1296 Glu Lys Cys Glu Leu Ala Lys Ile Ala Val Lys Arg Gly Val Gln Asn 420 425 430 act aat agt gag ctg atg gcc gaa gag tta aag tgg ttg act gga ggg 1344 Thr Asn Ser Glu Leu Met Ala Glu Glu Leu Lys Trp Leu Thr Gly Gly 435 440 445 act ctg ata tct cgg gat aaa gat ttc att gtc ctg tac aga gga aag 1392 Thr Leu Ile Ser Arg Asp Lys Asp Phe Ile Val Leu Tyr Arg Gly Lys 450 455 460 gat ttc ctc cca tct gca gtt tct tct gca ata gaa gag agg aga agg 1440 Asp Phe Leu Pro Ser Ala Val Ser Ser Ala Ile Glu Glu Arg Arg Arg 465 470 475 480 caa aca atg ata atg gag aat tct agt gtg cat ggt aat aag ctg act 1488 Gln Thr Met Ile Met Glu Asn Ser Ser Val His Gly Asn Lys Leu Thr 485 490 495 gaa aac gaa gag gaa ata aaa cct cgg gct gtt aaa gag gat att gaa 1536 Glu Asn Glu Glu Glu Ile Lys Pro Arg Ala Val Lys Glu Asp Ile Glu 500 505 510 cta gaa gct aag gat cag aag gat cat ata caa acc cat cag atg aaa 1584 Leu Glu Ala Lys Asp Gln Lys Asp His Ile Gln Thr His Gln Met Lys 515 520 525 tca aga cag cga aat tct cct gag gca atc ctt gaa aaa act agt atg 1632 Ser Arg Gln Arg Asn Ser Pro Glu Ala Ile Leu Glu Lys Thr Ser Met 530 535 540 aag ttg tcc atg gcg ttg gaa aag aaa gca aat gca gaa aaa gtc ctg 1680 Lys Leu Ser Met Ala Leu Glu Lys Lys Ala Asn Ala Glu Lys Val Leu 545 550 555 560 gcg gat ctg gag aat aga gag agt cct caa ctt tct gat att gac aag 1728 Ala Asp Leu Glu Asn Arg Glu Ser Pro Gln Leu Ser Asp Ile Asp Lys 565 570 575 gag ggt att act aat gat gaa aag tac atg ctc cgg aaa att ggc ttg 1776 Glu Gly Ile Thr Asn Asp Glu Lys Tyr Met Leu Arg Lys Ile Gly Leu 580 585 590 aaa atg aag cct ttc ctt tta cta ggt aga aga ggc gtc ttt gat gga 1824 Lys Met Lys Pro Phe Leu Leu Leu Gly Arg Arg Gly Val Phe Asp Gly 595 600 605 aca ata gag aac atg cat ctt cac tgg aag tat agg gaa ctt gtg aag 1872 Thr Ile Glu Asn Met His Leu His Trp Lys Tyr Arg Glu Leu Val Lys 610 615 620 atc att tgt aat gaa tat agc att gaa gct gct cac aaa gta gca gaa 1920 Ile Ile Cys Asn Glu Tyr Ser Ile Glu Ala Ala His Lys Val Ala Glu 625 630 635 640 atc ttg gaa gca gaa agt ggt ggc ata cta gtt gct gtg gag atg gtt 1968 Ile Leu Glu Ala Glu Ser Gly Gly Ile Leu Val Ala Val Glu Met Val 645 650 655 agt aag ggc tat gca atc att gtg tat cgt gga aag aat tat gag cgg 2016 Ser Lys Gly Tyr Ala Ile Ile Val Tyr Arg Gly Lys Asn Tyr Glu Arg 660 665 670 cct caa tgt cta cga cct caa aca ctt ctc agt aag aga gag gcg ctg 2064 Pro Gln Cys Leu Arg Pro Gln Thr Leu Leu Ser Lys Arg Glu Ala Leu 675 680 685 aag cgg tct gtg gag gca caa cgt cga aag tca ttg aag cta cat gtg 2112 Lys Arg Ser Val Glu Ala Gln Arg Arg Lys Ser Leu Lys Leu His Val 690 695 700 ctg aaa ctt tcc aac aat atc gaa gaa ttg aac cgt caa ttg gtc gag 2160 Leu Lys Leu Ser Asn Asn Ile Glu Glu Leu Asn Arg Gln Leu Val Glu 705 710 715 720 gac agc gca acc aat gaa act tgg tca gat ggg gaa tca agt aac atg 2208 Asp Ser Ala Thr Asn Glu Thr Trp Ser Asp Gly Glu Ser Ser Asn Met 725 730 735 atg gtc gaa gaa gaa aca gaa aat cag cat act gaa cct gag aaa gca 2256 Met Val Glu Glu Glu Thr Glu Asn Gln His Thr Glu Pro Glu Lys Ala 740 745 750 aga gaa aaa att gaa ctt ggc tat tca tct gac tta tca gtt cct tcc 2304 Arg Glu Lys Ile Glu Leu Gly Tyr Ser Ser Asp Leu Ser Val Pro Ser 755 760 765 tct ggt gaa gaa aac tgg gag gat gac agc gaa ggt gaa gta gac cca 2352 Ser Gly Glu Glu Asn Trp Glu Asp Asp Ser Glu Gly Glu Val Asp Pro 770 775 780 tta act acg agc agt cag gag tat caa gaa gat gaa tct gaa tct gcc 2400 Leu Thr Thr Ser Ser Gln Glu Tyr Gln Glu Asp Glu Ser Glu Ser Ala 785 790 795 800 tcg tct cag aga cac gag ggt aac tct ctc gat tca act gca aat ctg 2448 Ser Ser Gln Arg His Glu Gly Asn Ser Leu Asp Ser Thr Ala Asn Leu 805 810 815 tct gtg ttt gcg gaa acc ggt tcc gcc aat gct tct tca ttt cat gat 2496 Ser Val Phe Ala Glu Thr Gly Ser Ala Asn Ala Ser Ser Phe His Asp 820 825 830 aga tca ctg ccg cac aac agc ttc ctg aat gcc aat cga aag tta cca 2544 Arg Ser Leu Pro His Asn Ser Phe Leu Asn Ala Asn Arg Lys Leu Pro 835 840 845 gga tca tca act gga agt ggc tca caa atc tca gct ctg aga gag agg 2592 Gly Ser Ser Thr Gly Ser Gly Ser Gln Ile Ser Ala Leu Arg Glu Arg 850 855 860 aaa agc gaa aat gat ggg ttg gtc aca gat ctg tcc aat aga gaa agg 2640 Lys Ser Glu Asn Asp Gly Leu Val Thr Asp Leu Ser Asn Arg Glu Arg 865 870 875 880 ctc att ctg aga aaa caa gcc ctt aag atg aaa aag cga cca ccc ttt 2688 Leu Ile Leu Arg Lys Gln Ala Leu Lys Met Lys Lys Arg Pro Pro Phe 885 890 895 gca gta gga aga agc aac gtt gtt aca ggt tta gca aga aca ctg aag 2736 Ala Val Gly Arg Ser Asn Val Val Thr Gly Leu Ala Arg Thr Leu Lys 900 905 910 atg cat ttt cag aaa aac cct cta gca atc gta aac gtc aaa gga aga 2784 Met His Phe Gln Lys Asn Pro Leu Ala Ile Val Asn Val Lys Gly Arg 915 920 925 gca aat gga aca tct gtg caa gaa gtc atc gca aag cta aag gaa gaa 2832 Ala Asn Gly Thr Ser Val Gln Glu Val Ile Ala Lys Leu Lys Glu Glu 930 935 940 aca gga gct ctt ctg gtg tca cag gag cca agt aaa gtc ata ctt tac 2880 Thr Gly Ala Leu Leu Val Ser Gln Glu Pro Ser Lys Val Ile Leu Tyr 945 950 955 960 cgc gga tgg gga gcg gaa gaa gaa atg aaa agc ttc tac cca aat aac 2928 Arg Gly Trp Gly Ala Glu Glu Glu Met Lys Ser Phe Tyr Pro Asn Asn 965 970 975 aat gtc aaa agc tct ata aat cta ccc tct acg aga agc ttt gtt gat 2976 Asn Val Lys Ser Ser Ile Asn Leu Pro Ser Thr Arg Ser Phe Val Asp 980 985 990 gat cct cct cat gtg tct cct gca ctc atc gaa gcc ata aga ctt gaa 3024 Asp Pro Pro His Val Ser Pro Ala Leu Ile Glu Ala Ile Arg Leu Glu 995 1000 1005 tgt ggt ttg taa 3036 Cys Gly Leu 1010 34 1011 PRT Arabidopsis thaliana 34 Met Leu Leu Pro Leu Phe His Gln Gln Pro Leu Ile Leu Ala Lys Thr 1 5 10 15 Phe Pro Asp Arg Ile Phe Pro Pro Phe Leu Val Pro Asn Thr Leu Val 20 25 30 Ser Arg Arg Asn Val Ser Arg Ala Asn Ser Gly Ile Phe Cys Ser Ser 35 40 45 Ala Ser Gly Arg Lys Thr Leu Pro Gln Ser Ala Ile Gln Arg Ile Ala 50 55 60 Glu Lys Leu Arg Ser Leu Gly Phe Val Glu Glu Lys His Asp Ser Pro 65 70 75 80 Thr Arg Arg Ile Thr Gly Glu Glu Ser Gly Lys Asn Ser Pro Gly Glu 85 90 95 Ile Phe Val Pro Leu Pro Lys Gln Leu Pro Ile His Arg Val Gly His 100 105 110 Thr Ile Asp Thr Ser Trp Ser Thr Pro Ser Tyr Pro Val Pro Lys Pro 115 120 125 Gly Ser Gly Thr Ala Ile Ser Arg Tyr His Glu Leu Lys Arg Val Trp 130 135 140 Lys Lys Glu Thr Glu Met Glu Arg Lys Lys Glu Glu Lys Val Pro Ser 145 150 155 160 Leu Ala Glu Leu Thr Leu Pro Pro Ala Glu Leu Arg Arg Leu Arg Thr 165 170 175 Val Gly Ile Arg Leu Thr Lys Lys Leu Lys Ile Gly Lys Ala Gly Ile 180 185 190 Thr Glu Gly Ile Val Asn Gly Ile His Glu Arg Trp Arg Thr Thr Glu 195 200 205 Val Val Lys Ile Phe Cys Glu Asp Ile Ser Arg Met Asn Met Lys Arg 210 215 220 Thr His Asp Val Leu Glu Thr Lys Thr Gly Gly Leu Val Ile Trp Arg 225 230 235 240 Ser Gly Ser Lys Ile Leu Leu Tyr Arg Gly Val Asn Tyr Gln Tyr Pro 245 250 255 Tyr Phe Val Ser Asp Arg Asp Leu Ala His Glu Ala Ala Ser Gly Ala 260 265 270 Ser Ser Met Asp Gln Gly Val Val Asp Ser Arg Glu Lys Gln Ser Ile 275 280 285 Ala Glu Ser Ser Ala Pro Ser Ile Thr Asn Lys Met Val Lys Pro Met 290 295 300 Leu Thr Gln Gly Val Gly Ser Pro Asp Lys Val Arg Phe Gln Leu Pro 305 310 315 320 Gly Glu Val Gln Leu Val Glu Glu Ala Asp Arg Leu Leu Glu Gly Leu 325 330 335 Gly Pro Arg Phe Thr Asp Trp Trp Ala Tyr Asp Pro Leu Pro Val Asp 340 345 350 Gly Asp Leu Leu Pro Ala Val Val Pro Asp Tyr Arg Arg Pro Phe Arg 355 360 365 Leu Leu Pro Tyr Gly Val Ser Pro Lys Leu Thr Asp Asp Glu Met Thr 370 375 380 Thr Ile Arg Arg Leu Gly Arg Pro Leu Pro Cys His Phe Ala Leu Gly 385 390 395 400 Arg Asn Arg Asn Leu Gln Gly Leu Ala Val Ala Ile Val Lys Leu Trp 405 410 415 Glu Lys Cys Glu Leu Ala Lys Ile Ala Val Lys Arg Gly Val Gln Asn 420 425 430 Thr Asn Ser Glu Leu Met Ala Glu Glu Leu Lys Trp Leu Thr Gly Gly 435 440 445 Thr Leu Ile Ser Arg Asp Lys Asp Phe Ile Val Leu Tyr Arg Gly Lys 450 455 460 Asp Phe Leu Pro Ser Ala Val Ser Ser Ala Ile Glu Glu Arg Arg Arg 465 470 475 480 Gln Thr Met Ile Met Glu Asn Ser Ser Val His Gly Asn Lys Leu Thr 485 490 495 Glu Asn Glu Glu Glu Ile Lys Pro Arg Ala Val Lys Glu Asp Ile Glu 500 505 510 Leu Glu Ala Lys Asp Gln Lys Asp His Ile Gln Thr His Gln Met Lys 515 520 525 Ser Arg Gln Arg Asn Ser Pro Glu Ala Ile Leu Glu Lys Thr Ser Met 530 535 540 Lys Leu Ser Met Ala Leu Glu Lys Lys Ala Asn Ala Glu Lys Val Leu 545 550 555 560 Ala Asp Leu Glu Asn Arg Glu Ser Pro Gln Leu Ser Asp Ile Asp Lys 565 570 575 Glu Gly Ile Thr Asn Asp Glu Lys Tyr Met Leu Arg Lys Ile Gly Leu 580 585 590 Lys Met Lys Pro Phe Leu Leu Leu Gly Arg Arg Gly Val Phe Asp Gly 595 600 605 Thr Ile Glu Asn Met His Leu His Trp Lys Tyr Arg Glu Leu Val Lys 610 615 620 Ile Ile Cys Asn Glu Tyr Ser Ile Glu Ala Ala His Lys Val Ala Glu 625 630 635 640 Ile Leu Glu Ala Glu Ser Gly Gly Ile Leu Val Ala Val Glu Met Val 645 650 655 Ser Lys Gly Tyr Ala Ile Ile Val Tyr Arg Gly Lys Asn Tyr Glu Arg 660 665 670 Pro Gln Cys Leu Arg Pro Gln Thr Leu Leu Ser Lys Arg Glu Ala Leu 675 680 685 Lys Arg Ser Val Glu Ala Gln Arg Arg Lys Ser Leu Lys Leu His Val 690 695 700 Leu Lys Leu Ser Asn Asn Ile Glu Glu Leu Asn Arg Gln Leu Val Glu 705 710 715 720 Asp Ser Ala Thr Asn Glu Thr Trp Ser Asp Gly Glu Ser Ser Asn Met 725 730 735 Met Val Glu Glu Glu Thr Glu Asn Gln His Thr Glu Pro Glu Lys Ala 740 745 750 Arg Glu Lys Ile Glu Leu Gly Tyr Ser Ser Asp Leu Ser Val Pro Ser 755 760 765 Ser Gly Glu Glu Asn Trp Glu Asp Asp Ser Glu Gly Glu Val Asp Pro 770 775 780 Leu Thr Thr Ser Ser Gln Glu Tyr Gln Glu Asp Glu Ser Glu Ser Ala 785 790 795 800 Ser Ser Gln Arg His Glu Gly Asn Ser Leu Asp Ser Thr Ala Asn Leu 805 810 815 Ser Val Phe Ala Glu Thr Gly Ser Ala Asn Ala Ser Ser Phe His Asp 820 825 830 Arg Ser Leu Pro His Asn Ser Phe Leu Asn Ala Asn Arg Lys Leu Pro 835 840 845 Gly Ser Ser Thr Gly Ser Gly Ser Gln Ile Ser Ala Leu Arg Glu Arg 850 855 860 Lys Ser Glu Asn Asp Gly Leu Val Thr Asp Leu Ser Asn Arg Glu Arg 865 870 875 880 Leu Ile Leu Arg Lys Gln Ala Leu Lys Met Lys Lys Arg Pro Pro Phe 885 890 895 Ala Val Gly Arg Ser Asn Val Val Thr Gly Leu Ala Arg Thr Leu Lys 900 905 910 Met His Phe Gln Lys Asn Pro Leu Ala Ile Val Asn Val Lys Gly Arg 915 920 925 Ala Asn Gly Thr Ser Val Gln Glu Val Ile Ala Lys Leu Lys Glu Glu 930 935 940 Thr Gly Ala Leu Leu Val Ser Gln Glu Pro Ser Lys Val Ile Leu Tyr 945 950 955 960 Arg Gly Trp Gly Ala Glu Glu Glu Met Lys Ser Phe Tyr Pro Asn Asn 965 970 975 Asn Val Lys Ser Ser Ile Asn Leu Pro Ser Thr Arg Ser Phe Val Asp 980 985 990 Asp Pro Pro His Val Ser Pro Ala Leu Ile Glu Ala Ile Arg Leu Glu 995 1000 1005 Cys Gly Leu 1010 35 654 DNA Arabidopsis thaliana CDS (1)..(654) 35 atg gcg aat tca atg gct aca ctt tct aga aga ctc tat cga tct ctt 48 Met Ala Asn Ser Met Ala Thr Leu Ser Arg Arg Leu Tyr Arg Ser Leu 1 5 10 15 cta tcg aac cct aga att tct caa gct tcc atg tct ttc tgt acc aac 96 Leu Ser Asn Pro Arg Ile Ser Gln Ala Ser Met Ser Phe Cys Thr Asn 20 25 30 aac atc act tcc cca gaa gac tcc gac ttt gac gag tta gaa tcg ccc 144 Asn Ile Thr Ser Pro Glu Asp Ser Asp Phe Asp Glu Leu Glu Ser Pro 35 40 45 att gaa cca aag gcc tca gat cca gtt tct cga ttt tcc gga gaa gaa 192 Ile Glu Pro Lys Ala Ser Asp Pro Val Ser Arg Phe Ser Gly Glu Glu 50 55 60 cgc gtt atg gaa gag cgt cca ctc gaa aat ggc ctt gac tct ggc att 240 Arg Val Met Glu Glu Arg Pro Leu Glu Asn Gly Leu Asp Ser Gly Ile 65 70 75 80 ttc aag gca ata ttg gtg ggg caa gtg ggt cag ctt cct ctg cag aag 288 Phe Lys Ala Ile Leu Val Gly Gln Val Gly Gln Leu Pro Leu Gln Lys 85 90 95 aag ctt aag agt ggt aga act gtc act ctc ttc tct gta gga act ggt 336 Lys Leu Lys Ser Gly Arg Thr Val Thr Leu Phe Ser Val Gly Thr Gly 100 105 110 ggg atc agg aac aac cgg aga ccg ctt att aac gaa gat ccg aga gag 384 Gly Ile Arg Asn Asn Arg Arg Pro Leu Ile Asn Glu Asp Pro Arg Glu 115 120 125 tat gcg agc cgg tct gcg gtg cag tgg cac cgt gtc tct gtt tac cct 432 Tyr Ala Ser Arg Ser Ala Val Gln Trp His Arg Val Ser Val Tyr Pro 130 135 140 gaa cgc ttg gca gat ctt gtg ttg aag aat gtt gaa ccc ggt act gtt 480 Glu Arg Leu Ala Asp Leu Val Leu Lys Asn Val Glu Pro Gly Thr Val 145 150 155 160 atc tac tta gag ggt aat ctg gag acg aag ata ttc act gat cct gtc 528 Ile Tyr Leu Glu Gly Asn Leu Glu Thr Lys Ile Phe Thr Asp Pro Val 165 170 175 act ggt ttg gtt cga cgt ata aga gaa gta gca att cgt aga aac ggc 576 Thr Gly Leu Val Arg Arg Ile Arg Glu Val Ala Ile Arg Arg Asn Gly 180 185 190 cga gtt gtg ttt ttg gga aaa gcc ggt gat atg cag caa ccg agt tct 624 Arg Val Val Phe Leu Gly Lys Ala Gly Asp Met Gln Gln Pro Ser Ser 195 200 205 gct gag ctt aga ggc gtt ggc tac tac tga 654 Ala Glu Leu Arg Gly Val Gly Tyr Tyr 210 215 36 217 PRT Arabidopsis thaliana 36 Met Ala Asn Ser Met Ala Thr Leu Ser Arg Arg Leu Tyr Arg Ser Leu 1 5 10 15 Leu Ser Asn Pro Arg Ile Ser Gln Ala Ser Met Ser Phe Cys Thr Asn 20 25 30 Asn Ile Thr Ser Pro Glu Asp Ser Asp Phe Asp Glu Leu Glu Ser Pro 35 40 45 Ile Glu Pro Lys Ala Ser Asp Pro Val Ser Arg Phe Ser Gly Glu Glu 50 55 60 Arg Val Met Glu Glu Arg Pro Leu Glu Asn Gly Leu Asp Ser Gly Ile 65 70 75 80 Phe Lys Ala Ile Leu Val Gly Gln Val Gly Gln Leu Pro Leu Gln Lys 85 90 95 Lys Leu Lys Ser Gly Arg Thr Val Thr Leu Phe Ser Val Gly Thr Gly 100 105 110 Gly Ile Arg Asn Asn Arg Arg Pro Leu Ile Asn Glu Asp Pro Arg Glu 115 120 125 Tyr Ala Ser Arg Ser Ala Val Gln Trp His Arg Val Ser Val Tyr Pro 130 135 140 Glu Arg Leu Ala Asp Leu Val Leu Lys Asn Val Glu Pro Gly Thr Val 145 150 155 160 Ile Tyr Leu Glu Gly Asn Leu Glu Thr Lys Ile Phe Thr Asp Pro Val 165 170 175 Thr Gly Leu Val Arg Arg Ile Arg Glu Val Ala Ile Arg Arg Asn Gly 180 185 190 Arg Val Val Phe Leu Gly Lys Ala Gly Asp Met Gln Gln Pro Ser Ser 195 200 205 Ala Glu Leu Arg Gly Val Gly Tyr Tyr 210 215 37 1320 DNA Arabidopsis thaliana CDS (1)..(1320) 37 atg gct tca gat gct gac atg gag gac tat gga ttc gag tat tct gat 48 Met Ala Ser Asp Ala Asp Met Glu Asp Tyr Gly Phe Glu Tyr Ser Asp 1 5 10 15 gag gag caa gag gaa caa gat gtt gac att gag aac cag tat tat aac 96 Glu Glu Gln Glu Glu Gln Asp Val Asp Ile Glu Asn Gln Tyr Tyr Asn 20 25 30 tct aaa ggt atg gtt gag act gaa cct gaa gaa gca ctt tct gga ttt 144 Ser Lys Gly Met Val Glu Thr Glu Pro Glu Glu Ala Leu Ser Gly Phe 35 40 45 gct gag gtt gtt cag atg gaa cct gag aaa gct gac tgg ggt ttc aaa 192 Ala Glu Val Val Gln Met Glu Pro Glu Lys Ala Asp Trp Gly Phe Lys 50 55 60 gct ctt aag cag act gtg aag atc tat tat cgt cta ggt aaa tac aaa 240 Ala Leu Lys Gln Thr Val Lys Ile Tyr Tyr Arg Leu Gly Lys Tyr Lys 65 70 75 80 gaa atg atg gaa gcc tat aca gag atg ctt aca tat atc aag tca gca 288 Glu Met Met Glu Ala Tyr Thr Glu Met Leu Thr Tyr Ile Lys Ser Ala 85 90 95 gtc acc agg aat tac agc gaa aaa tgt ata aac aat atc atg gat ttc 336 Val Thr Arg Asn Tyr Ser Glu Lys Cys Ile Asn Asn Ile Met Asp Phe 100 105 110 gtc tct gga tct gct agc cag aac act ggc ctg ctg caa gag ttc tat 384 Val Ser Gly Ser Ala Ser Gln Asn Thr Gly Leu Leu Gln Glu Phe Tyr 115 120 125 cag acc act ttg aaa gcc ctt gaa gag gct aag aat gag aga ctc tgg 432 Gln Thr Thr Leu Lys Ala Leu Glu Glu Ala Lys Asn Glu Arg Leu Trp 130 135 140 ttc aag aca aat ctt aaa ctc tgc aac atc tgg ttt gac att ggt gaa 480 Phe Lys Thr Asn Leu Lys Leu Cys Asn Ile Trp Phe Asp Ile Gly Glu 145 150 155 160 tac aga cgg atg act aag att ctg aag gaa ctc cat aag tct tgt caa 528 Tyr Arg Arg Met Thr Lys Ile Leu Lys Glu Leu His Lys Ser Cys Gln 165 170 175 aag gaa gat gga act gat gat cag aag aaa gga agt cag ctg ctt gag 576 Lys Glu Asp Gly Thr Asp Asp Gln Lys Lys Gly Ser Gln Leu Leu Glu 180 185 190 gtt tat gca att gaa att cag att tac act gaa acg aag gac aac aaa 624 Val Tyr Ala Ile Glu Ile Gln Ile Tyr Thr Glu Thr Lys Asp Asn Lys 195 200 205 aag ctt aag caa cta tac cat aag gca ctt gcc atc aag tct gcc ata 672 Lys Leu Lys Gln Leu Tyr His Lys Ala Leu Ala Ile Lys Ser Ala Ile 210 215 220 cct cat cct agg atc atg ggt ata atc cgc gag tgt ggt gga aaa atg 720 Pro His Pro Arg Ile Met Gly Ile Ile Arg Glu Cys Gly Gly Lys Met 225 230 235 240 cac atg gca gaa cgt cag tgg gaa gaa gca gcg aca gac ttc ttt gag 768 His Met Ala Glu Arg Gln Trp Glu Glu Ala Ala Thr Asp Phe Phe Glu 245 250 255 gct ttt aaa aat tac gat gaa gct ggc aac caa aga cgt ata cag tgc 816 Ala Phe Lys Asn Tyr Asp Glu Ala Gly Asn Gln Arg Arg Ile Gln Cys 260 265 270 ctg aag tat ctt gtt ctg gcg aat atg ctg atg gag tca gaa gtg aat 864 Leu Lys Tyr Leu Val Leu Ala Asn Met Leu Met Glu Ser Glu Val Asn 275 280 285 ccg ttt gat ggc caa gag gca aag ccg tac aaa aac gac cct gag att 912 Pro Phe Asp Gly Gln Glu Ala Lys Pro Tyr Lys Asn Asp Pro Glu Ile 290 295 300 ttg gca atg aca aat ttg atc gca gca tat caa cga aat gag atc ata 960 Leu Ala Met Thr Asn Leu Ile Ala Ala Tyr Gln Arg Asn Glu Ile Ile 305 310 315 320 gag ttt gag aga ata cta aag agt aac cgg agg acg ata atg gat gat 1008 Glu Phe Glu Arg Ile Leu Lys Ser Asn Arg Arg Thr Ile Met Asp Asp 325 330 335 cca ttc atc aga aac tac atg gaa gat ctg ctg aag aaa gtg aga aca 1056 Pro Phe Ile Arg Asn Tyr Met Glu Asp Leu Leu Lys Lys Val Arg Thr 340 345 350 caa gtg tta ctg aag ctg att aag cca tac aca aag ata ggg att ccg 1104 Gln Val Leu Leu Lys Leu Ile Lys Pro Tyr Thr Lys Ile Gly Ile Pro 355 360 365 ttc ata tca aag gaa ctg aac gtg cca gag acg gat gtg acg gag tta 1152 Phe Ile Ser Lys Glu Leu Asn Val Pro Glu Thr Asp Val Thr Glu Leu 370 375 380 cta gtg tcg ctg ata ctc gac agc aga atc gat ggt cac att gat gaa 1200 Leu Val Ser Leu Ile Leu Asp Ser Arg Ile Asp Gly His Ile Asp Glu 385 390 395 400 atg aac cgt tac tta ctg aga ggc gac agt ggc aat gga agg aag ttg 1248 Met Asn Arg Tyr Leu Leu Arg Gly Asp Ser Gly Asn Gly Arg Lys Leu 405 410 415 cat aag gcg gtt gat aaa tgg aac tca cag ctc aag tct ctt tct tca 1296 His Lys Ala Val Asp Lys Trp Asn Ser Gln Leu Lys Ser Leu Ser Ser 420 425 430 aat atc acc agc cga gtc tgt taa 1320 Asn Ile Thr Ser Arg Val Cys 435 38 439 PRT Arabidopsis thaliana 38 Met Ala Ser Asp Ala Asp Met Glu Asp Tyr Gly Phe Glu Tyr Ser Asp 1 5 10 15 Glu Glu Gln Glu Glu Gln Asp Val Asp Ile Glu Asn Gln Tyr Tyr Asn 20 25 30 Ser Lys Gly Met Val Glu Thr Glu Pro Glu Glu Ala Leu Ser Gly Phe 35 40 45 Ala Glu Val Val Gln Met Glu Pro Glu Lys Ala Asp Trp Gly Phe Lys 50 55 60 Ala Leu Lys Gln Thr Val Lys Ile Tyr Tyr Arg Leu Gly Lys Tyr Lys 65 70 75 80 Glu Met Met Glu Ala Tyr Thr Glu Met Leu Thr Tyr Ile Lys Ser Ala 85 90 95 Val Thr Arg Asn Tyr Ser Glu Lys Cys Ile Asn Asn Ile Met Asp Phe 100 105 110 Val Ser Gly Ser Ala Ser Gln Asn Thr Gly Leu Leu Gln Glu Phe Tyr 115 120 125 Gln Thr Thr Leu Lys Ala Leu Glu Glu Ala Lys Asn Glu Arg Leu Trp 130 135 140 Phe Lys Thr Asn Leu Lys Leu Cys Asn Ile Trp Phe Asp Ile Gly Glu 145 150 155 160 Tyr Arg Arg Met Thr Lys Ile Leu Lys Glu Leu His Lys Ser Cys Gln 165 170 175 Lys Glu Asp Gly Thr Asp Asp Gln Lys Lys Gly Ser Gln Leu Leu Glu 180 185 190 Val Tyr Ala Ile Glu Ile Gln Ile Tyr Thr Glu Thr Lys Asp Asn Lys 195 200 205 Lys Leu Lys Gln Leu Tyr His Lys Ala Leu Ala Ile Lys Ser Ala Ile 210 215 220 Pro His Pro Arg Ile Met Gly Ile Ile Arg Glu Cys Gly Gly Lys Met 225 230 235 240 His Met Ala Glu Arg Gln Trp Glu Glu Ala Ala Thr Asp Phe Phe Glu 245 250 255 Ala Phe Lys Asn Tyr Asp Glu Ala Gly Asn Gln Arg Arg Ile Gln Cys 260 265 270 Leu Lys Tyr Leu Val Leu Ala Asn Met Leu Met Glu Ser Glu Val Asn 275 280 285 Pro Phe Asp Gly Gln Glu Ala Lys Pro Tyr Lys Asn Asp Pro Glu Ile 290 295 300 Leu Ala Met Thr Asn Leu Ile Ala Ala Tyr Gln Arg Asn Glu Ile Ile 305 310 315 320 Glu Phe Glu Arg Ile Leu Lys Ser Asn Arg Arg Thr Ile Met Asp Asp 325 330 335 Pro Phe Ile Arg Asn Tyr Met Glu Asp Leu Leu Lys Lys Val Arg Thr 340 345 350 Gln Val Leu Leu Lys Leu Ile Lys Pro Tyr Thr Lys Ile Gly Ile Pro 355 360 365 Phe Ile Ser Lys Glu Leu Asn Val Pro Glu Thr Asp Val Thr Glu Leu 370 375 380 Leu Val Ser Leu Ile Leu Asp Ser Arg Ile Asp Gly His Ile Asp Glu 385 390 395 400 Met Asn Arg Tyr Leu Leu Arg Gly Asp Ser Gly Asn Gly Arg Lys Leu 405 410 415 His Lys Ala Val Asp Lys Trp Asn Ser Gln Leu Lys Ser Leu Ser Ser 420 425 430 Asn Ile Thr Ser Arg Val Cys 435 39 1158 DNA Arabidopsis thaliana CDS (1)..(1158) 39 atg ggt ttc aaa ctc aac tct ctt ttc ctt ctt ctt tcg ctt ctc ata 48 Met Gly Phe Lys Leu Asn Ser Leu Phe Leu Leu Leu Ser Leu Leu Ile 1 5 10 15 ctc aca atc ttg tcc gga gcc gac caa gcc ttg gct tac gtc aag aaa 96 Leu Thr Ile Leu Ser Gly Ala Asp Gln Ala Leu Ala Tyr Val Lys Lys 20 25 30 cca cat gtg agc caa cgc aac aaa acg gcg tta gca gcc gtc gcc gga 144 Pro His Val Ser Gln Arg Asn Lys Thr Ala Leu Ala Ala Val Ala Gly 35 40 45 aga gga gga gga aaa gag atg ttg aaa ggg agg aag cag acg agt ggt 192 Arg Gly Gly Gly Lys Glu Met Leu Lys Gly Arg Lys Gln Thr Ser Gly 50 55 60 tgt aac ttg ttt caa ggg aga tgg gtt ttc gat gct tct tac cct ttc 240 Cys Asn Leu Phe Gln Gly Arg Trp Val Phe Asp Ala Ser Tyr Pro Phe 65 70 75 80 tac gat tca tcc acg tgc cct ttc atc gac ggt gag ttc gac tgt ctc 288 Tyr Asp Ser Ser Thr Cys Pro Phe Ile Asp Gly Glu Phe Asp Cys Leu 85 90 95 aaa ttc ggc cga cca gac aaa cag ttc ctt aag tac tct tgg cag cct 336 Lys Phe Gly Arg Pro Asp Lys Gln Phe Leu Lys Tyr Ser Trp Gln Pro 100 105 110 gat tca tgc acc gtc cca agg ttt gat ggg gaa gcg ttt ttg aag aaa 384 Asp Ser Cys Thr Val Pro Arg Phe Asp Gly Glu Ala Phe Leu Lys Lys 115 120 125 tgg aga ggg aaa cga gtg atg ttc gtg ggt gac tca ctg agt cta aac 432 Trp Arg Gly Lys Arg Val Met Phe Val Gly Asp Ser Leu Ser Leu Asn 130 135 140 atg tgg gaa tcg ttg gca tgt atg ata cat tcg tcg gtt cca aac act 480 Met Trp Glu Ser Leu Ala Cys Met Ile His Ser Ser Val Pro Asn Thr 145 150 155 160 aag acc act ttt ctc aag cga acc cca ctc tcc tct ctc act ttc cag 528 Lys Thr Thr Phe Leu Lys Arg Thr Pro Leu Ser Ser Leu Thr Phe Gln 165 170 175 gaa tat gac gtc aca cta ttc cta tac cga aca cca tac cta gtg gac 576 Glu Tyr Asp Val Thr Leu Phe Leu Tyr Arg Thr Pro Tyr Leu Val Asp 180 185 190 atc tcc aaa gaa agt gtc ggg cgt gtg ctt aac ctt gga gcc att gaa 624 Ile Ser Lys Glu Ser Val Gly Arg Val Leu Asn Leu Gly Ala Ile Glu 195 200 205 gat gga gct gat gct tgg aaa aat atg gac ctc ctc gtc ttc aat tct 672 Asp Gly Ala Asp Ala Trp Lys Asn Met Asp Leu Leu Val Phe Asn Ser 210 215 220 tgg cac tgg tgg act cac aca gga gta cag tct caa ggg tgg gat ttt 720 Trp His Trp Trp Thr His Thr Gly Val Gln Ser Gln Gly Trp Asp Phe 225 230 235 240 ata aga gat ggg tct tca ctg atg aga gac atg gac cgc ctt gat gct 768 Ile Arg Asp Gly Ser Ser Leu Met Arg Asp Met Asp Arg Leu Asp Ala 245 250 255 ttc aac aaa gga ctc acc act tgg ggt caa tgg gtt gat caa aat gtt 816 Phe Asn Lys Gly Leu Thr Thr Trp Gly Gln Trp Val Asp Gln Asn Val 260 265 270 aat gtt tcg caa acc cga gtc ttc ttc caa ggc att tct ccc act cac 864 Asn Val Ser Gln Thr Arg Val Phe Phe Gln Gly Ile Ser Pro Thr His 275 280 285 tac atg gga agg gaa tgg aac gag ccg agg aaa act tgc aac ggg cag 912 Tyr Met Gly Arg Glu Trp Asn Glu Pro Arg Lys Thr Cys Asn Gly Gln 290 295 300 atg caa ccg ttg acc gga tca aca tac cca ggt ggt tca ctt cct gca 960 Met Gln Pro Leu Thr Gly Ser Thr Tyr Pro Gly Gly Ser Leu Pro Ala 305 310 315 320 gca agc att gtg tct cga gta tta agc acg atg aga acg ccc gtt tac 1008 Ala Ser Ile Val Ser Arg Val Leu Ser Thr Met Arg Thr Pro Val Tyr 325 330 335 tta ctc gac atc aca act cta tct caa ctg aga aaa gat gct cat cca 1056 Leu Leu Asp Ile Thr Thr Leu Ser Gln Leu Arg Lys Asp Ala His Pro 340 345 350 tct aca tat gga ggc gat ggc gga acg gat tgc agt cac tgg tgc ctt 1104 Ser Thr Tyr Gly Gly Asp Gly Gly Thr Asp Cys Ser His Trp Cys Leu 355 360 365 cct ggc ttg ccg gat act tgg aac cag ctt ctc tat gca gct ctt tcg 1152 Pro Gly Leu Pro Asp Thr Trp Asn Gln Leu Leu Tyr Ala Ala Leu Ser 370 375 380 atg tga 1158 Met 385 40 385 PRT Arabidopsis thaliana 40 Met Gly Phe Lys Leu Asn Ser Leu Phe Leu Leu Leu Ser Leu Leu Ile 1 5 10 15 Leu Thr Ile Leu Ser Gly Ala Asp Gln Ala Leu Ala Tyr Val Lys Lys 20 25 30 Pro His Val Ser Gln Arg Asn Lys Thr Ala Leu Ala Ala Val Ala Gly 35 40 45 Arg Gly Gly Gly Lys Glu Met Leu Lys Gly Arg Lys Gln Thr Ser Gly 50 55 60 Cys Asn Leu Phe Gln Gly Arg Trp Val Phe Asp Ala Ser Tyr Pro Phe 65 70 75 80 Tyr Asp Ser Ser Thr Cys Pro Phe Ile Asp Gly Glu Phe Asp Cys Leu 85 90 95 Lys Phe Gly Arg Pro Asp Lys Gln Phe Leu Lys Tyr Ser Trp Gln Pro 100 105 110 Asp Ser Cys Thr Val Pro Arg Phe Asp Gly Glu Ala Phe Leu Lys Lys 115 120 125 Trp Arg Gly Lys Arg Val Met Phe Val Gly Asp Ser Leu Ser Leu Asn 130 135 140 Met Trp Glu Ser Leu Ala Cys Met Ile His Ser Ser Val Pro Asn Thr 145 150 155 160 Lys Thr Thr Phe Leu Lys Arg Thr Pro Leu Ser Ser Leu Thr Phe Gln 165 170 175 Glu Tyr Asp Val Thr Leu Phe Leu Tyr Arg Thr Pro Tyr Leu Val Asp 180 185 190 Ile Ser Lys Glu Ser Val Gly Arg Val Leu Asn Leu Gly Ala Ile Glu 195 200 205 Asp Gly Ala Asp Ala Trp Lys Asn Met Asp Leu Leu Val Phe Asn Ser 210 215 220 Trp His Trp Trp Thr His Thr Gly Val Gln Ser Gln Gly Trp Asp Phe 225 230 235 240 Ile Arg Asp Gly Ser Ser Leu Met Arg Asp Met Asp Arg Leu Asp Ala 245 250 255 Phe Asn Lys Gly Leu Thr Thr Trp Gly Gln Trp Val Asp Gln Asn Val 260 265 270 Asn Val Ser Gln Thr Arg Val Phe Phe Gln Gly Ile Ser Pro Thr His 275 280 285 Tyr Met Gly Arg Glu Trp Asn Glu Pro Arg Lys Thr Cys Asn Gly Gln 290 295 300 Met Gln Pro Leu Thr Gly Ser Thr Tyr Pro Gly Gly Ser Leu Pro Ala 305 310 315 320 Ala Ser Ile Val Ser Arg Val Leu Ser Thr Met Arg Thr Pro Val Tyr 325 330 335 Leu Leu Asp Ile Thr Thr Leu Ser Gln Leu Arg Lys Asp Ala His Pro 340 345 350 Ser Thr Tyr Gly Gly Asp Gly Gly Thr Asp Cys Ser His Trp Cys Leu 355 360 365 Pro Gly Leu Pro Asp Thr Trp Asn Gln Leu Leu Tyr Ala Ala Leu Ser 370 375 380 Met 385 41 1194 DNA Arabidopsis thaliana CDS (1)..(1194) 41 atg tta cag tcc att cat ctt cgt ttt tcc tcc aca cca tca cct tct 48 Met Leu Gln Ser Ile His Leu Arg Phe Ser Ser Thr Pro Ser Pro Ser 1 5 10 15 aaa aga gaa tct ctc ata att cca tcg gtt att tgc tca ttt cct ttc 96 Lys Arg Glu Ser Leu Ile Ile Pro Ser Val Ile Cys Ser Phe Pro Phe 20 25 30 acc tct tct tcg ttc cgt cca aag caa acc cag aaa ctg aag cgt ctg 144 Thr Ser Ser Ser Phe Arg Pro Lys Gln Thr Gln Lys Leu Lys Arg Leu 35 40 45 gtt caa ttt tgc gct cct tac gag gtc gga ggt gga tac acc gat gaa 192 Val Gln Phe Cys Ala Pro Tyr Glu Val Gly Gly Gly Tyr Thr Asp Glu 50 55 60 gaa ttg ttc gaa aga tac gga act cag caa aat caa act aat gtc aaa 240 Glu Leu Phe Glu Arg Tyr Gly Thr Gln Gln Asn Gln Thr Asn Val Lys 65 70 75 80 gat aaa tta gat cca gct gag tat gaa gct ttg ctt aaa gga ggc gaa 288 Asp Lys Leu Asp Pro Ala Glu Tyr Glu Ala Leu Leu Lys Gly Gly Glu 85 90 95 caa gtg aca tcc gtt ctt gaa gaa atg att acc ctc ttg gaa gat atg 336 Gln Val Thr Ser Val Leu Glu Glu Met Ile Thr Leu Leu Glu Asp Met 100 105 110 aag atg aat gaa gca tct gag aat gtt gct gta gaa ttg gct gca caa 384 Lys Met Asn Glu Ala Ser Glu Asn Val Ala Val Glu Leu Ala Ala Gln 115 120 125 gga gtt ata ggg aaa agg gtc gat gaa atg gaa tca ggg ttt atg atg 432 Gly Val Ile Gly Lys Arg Val Asp Glu Met Glu Ser Gly Phe Met Met 130 135 140 gct ctt gat tac atg atc caa ctt gca gac aaa gac caa gac gag aag 480 Ala Leu Asp Tyr Met Ile Gln Leu Ala Asp Lys Asp Gln Asp Glu Lys 145 150 155 160 aga aag tct tta cta gag gtc gtc aag gag act gtc tta tct cat ctc 528 Arg Lys Ser Leu Leu Glu Val Val Lys Glu Thr Val Leu Ser His Leu 165 170 175 acc aag aaa tgc cct cct cat gtc cag gtg att ggt tta ctc tgt aga 576 Thr Lys Lys Cys Pro Pro His Val Gln Val Ile Gly Leu Leu Cys Arg 180 185 190 acc ccg aaa aag gaa agt aga cat gag ctt ctg cgt agg gtg gct gca 624 Thr Pro Lys Lys Glu Ser Arg His Glu Leu Leu Arg Arg Val Ala Ala 195 200 205 ggt ggt ggg gct ttt gaa agt gag aac ggt act aaa ctt cat ata ccc 672 Gly Gly Gly Ala Phe Glu Ser Glu Asn Gly Thr Lys Leu His Ile Pro 210 215 220 gga gca aat ctg aat gac ata gct aat caa gct gat gac ttg cta gag 720 Gly Ala Asn Leu Asn Asp Ile Ala Asn Gln Ala Asp Asp Leu Leu Glu 225 230 235 240 act atg gaa aca agg cca gct att ccg gat cga aaa cta cta gcg agg 768 Thr Met Glu Thr Arg Pro Ala Ile Pro Asp Arg Lys Leu Leu Ala Arg 245 250 255 ctt gtt ttg att aga gag gaa gcc cgg aac atg atg gga gga ggt ata 816 Leu Val Leu Ile Arg Glu Glu Ala Arg Asn Met Met Gly Gly Gly Ile 260 265 270 ctt gat gaa aga aat gac cga ggt ttc act act ctt cct gaa tca gag 864 Leu Asp Glu Arg Asn Asp Arg Gly Phe Thr Thr Leu Pro Glu Ser Glu 275 280 285 gtg aat ttc tta gcc aaa ttg gta gct ctg aaa cct gga aag act gtg 912 Val Asn Phe Leu Ala Lys Leu Val Ala Leu Lys Pro Gly Lys Thr Val 290 295 300 cag cag atg atc cag aat gta atg caa ggg aaa gat gaa ggc gca gat 960 Gln Gln Met Ile Gln Asn Val Met Gln Gly Lys Asp Glu Gly Ala Asp 305 310 315 320 aat ctt agc aaa gaa tcc gat tct tct acc gaa gga aga aaa cca agt 1008 Asn Leu Ser Lys Glu Ser Asp Ser Ser Thr Glu Gly Arg Lys Pro Ser 325 330 335 gga tta aaa gga agg gga agc gtt aca gga aga aaa ccg tta cca gta 1056 Gly Leu Lys Gly Arg Gly Ser Val Thr Gly Arg Lys Pro Leu Pro Val 340 345 350 aga cca gga atg ttt cta gaa act gtc aca aag gta ctg gga agt ata 1104 Arg Pro Gly Met Phe Leu Glu Thr Val Thr Lys Val Leu Gly Ser Ile 355 360 365 tac tcg ggt aat gcc tcc ggg ata aca gca caa cat cta gaa tgg gtt 1152 Tyr Ser Gly Asn Ala Ser Gly Ile Thr Ala Gln His Leu Glu Trp Val 370 375 380 cac caa aaa acc ctc caa gtt ctt gaa gaa atc gct tat tag 1194 His Gln Lys Thr Leu Gln Val Leu Glu Glu Ile Ala Tyr 385 390 395 42 397 PRT Arabidopsis thaliana 42 Met Leu Gln Ser Ile His Leu Arg Phe Ser Ser Thr Pro Ser Pro Ser 1 5 10 15 Lys Arg Glu Ser Leu Ile Ile Pro Ser Val Ile Cys Ser Phe Pro Phe 20 25 30 Thr Ser Ser Ser Phe Arg Pro Lys Gln Thr Gln Lys Leu Lys Arg Leu 35 40 45 Val Gln Phe Cys Ala Pro Tyr Glu Val Gly Gly Gly Tyr Thr Asp Glu 50 55 60 Glu Leu Phe Glu Arg Tyr Gly Thr Gln Gln Asn Gln Thr Asn Val Lys 65 70 75 80 Asp Lys Leu Asp Pro Ala Glu Tyr Glu Ala Leu Leu Lys Gly Gly Glu 85 90 95 Gln Val Thr Ser Val Leu Glu Glu Met Ile Thr Leu Leu Glu Asp Met 100 105 110 Lys Met Asn Glu Ala Ser Glu Asn Val Ala Val Glu Leu Ala Ala Gln 115 120 125 Gly Val Ile Gly Lys Arg Val Asp Glu Met Glu Ser Gly Phe Met Met 130 135 140 Ala Leu Asp Tyr Met Ile Gln Leu Ala Asp Lys Asp Gln Asp Glu Lys 145 150 155 160 Arg Lys Ser Leu Leu Glu Val Val Lys Glu Thr Val Leu Ser His Leu 165 170 175 Thr Lys Lys Cys Pro Pro His Val Gln Val Ile Gly Leu Leu Cys Arg 180 185 190 Thr Pro Lys Lys Glu Ser Arg His Glu Leu Leu Arg Arg Val Ala Ala 195 200 205 Gly Gly Gly Ala Phe Glu Ser Glu Asn Gly Thr Lys Leu His Ile Pro 210 215 220 Gly Ala Asn Leu Asn Asp Ile Ala Asn Gln Ala Asp Asp Leu Leu Glu 225 230 235 240 Thr Met Glu Thr Arg Pro Ala Ile Pro Asp Arg Lys Leu Leu Ala Arg 245 250 255 Leu Val Leu Ile Arg Glu Glu Ala Arg Asn Met Met Gly Gly Gly Ile 260 265 270 Leu Asp Glu Arg Asn Asp Arg Gly Phe Thr Thr Leu Pro Glu Ser Glu 275 280 285 Val Asn Phe Leu Ala Lys Leu Val Ala Leu Lys Pro Gly Lys Thr Val 290 295 300 Gln Gln Met Ile Gln Asn Val Met Gln Gly Lys Asp Glu Gly Ala Asp 305 310 315 320 Asn Leu Ser Lys Glu Ser Asp Ser Ser Thr Glu Gly Arg Lys Pro Ser 325 330 335 Gly Leu Lys Gly Arg Gly Ser Val Thr Gly Arg Lys Pro Leu Pro Val 340 345 350 Arg Pro Gly Met Phe Leu Glu Thr Val Thr Lys Val Leu Gly Ser Ile 355 360 365 Tyr Ser Gly Asn Ala Ser Gly Ile Thr Ala Gln His Leu Glu Trp Val 370 375 380 His Gln Lys Thr Leu Gln Val Leu Glu Glu Ile Ala Tyr 385 390 395 43 2451 DNA Arabidopsis thaliana CDS (1)..(2451) 43 atg ttg aga caa att cca acc tgc aaa ggt ggg cgc ttt aca cga gtt 48 Met Leu Arg Gln Ile Pro Thr Cys Lys Gly Gly Arg Phe Thr Arg Val 1 5 10 15 ttg caa tct atc gga tca gtt att aga gaa ttt tca gca tct gca aat 96 Leu Gln Ser Ile Gly Ser Val Ile Arg Glu Phe Ser Ala Ser Ala Asn 20 25 30 gct ttg caa gac tgt tgg aaa aat ggg aat gaa agt aaa gag att gat 144 Ala Leu Gln Asp Cys Trp Lys Asn Gly Asn Glu Ser Lys Glu Ile Asp 35 40 45 gat gtt cac aca ctt ttc aga tat tgc act aat ttg caa tca gcc aaa 192 Asp Val His Thr Leu Phe Arg Tyr Cys Thr Asn Leu Gln Ser Ala Lys 50 55 60 tgt ctc cac gca cgt ctc gtt gtc tcc aaa caa atc caa aat gtt tgt 240 Cys Leu His Ala Arg Leu Val Val Ser Lys Gln Ile Gln Asn Val Cys 65 70 75 80 atc tct gct aaa ctt gtt aat ctc tat tgt tac ctt ggc aat gtc gca 288 Ile Ser Ala Lys Leu Val Asn Leu Tyr Cys Tyr Leu Gly Asn Val Ala 85 90 95 ttg gct agg cac acg ttt gat cat atc cag aac aga gat gta tac gct 336 Leu Ala Arg His Thr Phe Asp His Ile Gln Asn Arg Asp Val Tyr Ala 100 105 110 tgg aat ttg atg att tct ggt tat ggc cgt gcg gga aat tct tca gaa 384 Trp Asn Leu Met Ile Ser Gly Tyr Gly Arg Ala Gly Asn Ser Ser Glu 115 120 125 gtg att agg tgt ttt agt ctc ttt atg ttg agt tct gga ctt aca cct 432 Val Ile Arg Cys Phe Ser Leu Phe Met Leu Ser Ser Gly Leu Thr Pro 130 135 140 gat tac agg act ttt cct tcg gtt ttg aaa gct tgt agg act gtg att 480 Asp Tyr Arg Thr Phe Pro Ser Val Leu Lys Ala Cys Arg Thr Val Ile 145 150 155 160 gat ggg aat aag att cat tgc ttg gct ttg aag ttt ggg ttt atg tgg 528 Asp Gly Asn Lys Ile His Cys Leu Ala Leu Lys Phe Gly Phe Met Trp 165 170 175 gat gtc tat gtt gct gcc tcg ttg att cat ttg tac tct cga tat aaa 576 Asp Val Tyr Val Ala Ala Ser Leu Ile His Leu Tyr Ser Arg Tyr Lys 180 185 190 gct gtg ggg aat gca cgt ata ctg ttt gat gaa atg cct gta aga gat 624 Ala Val Gly Asn Ala Arg Ile Leu Phe Asp Glu Met Pro Val Arg Asp 195 200 205 atg ggt tct tgg aat gca atg att tct ggg tat tgt caa agc gga aat 672 Met Gly Ser Trp Asn Ala Met Ile Ser Gly Tyr Cys Gln Ser Gly Asn 210 215 220 gct aaa gag gcg tta act ctg tct aat ggg ttg aga gct atg gat tca 720 Ala Lys Glu Ala Leu Thr Leu Ser Asn Gly Leu Arg Ala Met Asp Ser 225 230 235 240 gtt acc gtt gtg agt ctc ctt tcg gct tgc act gaa gcc ggt gat ttc 768 Val Thr Val Val Ser Leu Leu Ser Ala Cys Thr Glu Ala Gly Asp Phe 245 250 255 aat aga gga gtt act att cat tca tac tcg ata aaa cat ggt ctg gag 816 Asn Arg Gly Val Thr Ile His Ser Tyr Ser Ile Lys His Gly Leu Glu 260 265 270 tcc gaa ttg ttt gtt tcc aac aag ctg att gat ttg tat gct gaa ttt 864 Ser Glu Leu Phe Val Ser Asn Lys Leu Ile Asp Leu Tyr Ala Glu Phe 275 280 285 ggt agg ctt agg gat tgc cag aag gtt ttt gat agg atg tat gtg agg 912 Gly Arg Leu Arg Asp Cys Gln Lys Val Phe Asp Arg Met Tyr Val Arg 290 295 300 gat ttg att tct tgg aat tcg ata att aag gcg tac gag ctg aat gag 960 Asp Leu Ile Ser Trp Asn Ser Ile Ile Lys Ala Tyr Glu Leu Asn Glu 305 310 315 320 cag ccg ctt aga gcg att tcg ttg ttt cag gag atg cgg ttg agt aga 1008 Gln Pro Leu Arg Ala Ile Ser Leu Phe Gln Glu Met Arg Leu Ser Arg 325 330 335 att caa cca gat tgc ctt act ttg att agt ttg gct tct ata cta tca 1056 Ile Gln Pro Asp Cys Leu Thr Leu Ile Ser Leu Ala Ser Ile Leu Ser 340 345 350 cag tta ggg gac att cgt gct tgt aga tct gtg cag ggt ttc act ttg 1104 Gln Leu Gly Asp Ile Arg Ala Cys Arg Ser Val Gln Gly Phe Thr Leu 355 360 365 agg aaa ggc tgg ttc ttg gaa gat att acc atc ggg aat gct gtt gtg 1152 Arg Lys Gly Trp Phe Leu Glu Asp Ile Thr Ile Gly Asn Ala Val Val 370 375 380 gtt atg tat gca aaa cta ggc ctt gta gat tca gca aga gca gtg ttt 1200 Val Met Tyr Ala Lys Leu Gly Leu Val Asp Ser Ala Arg Ala Val Phe 385 390 395 400 aat tgg ctt ccg aat aca gat gtt att tca tgg aat acc ata atc tcg 1248 Asn Trp Leu Pro Asn Thr Asp Val Ile Ser Trp Asn Thr Ile Ile Ser 405 410 415 ggt tat gct caa aac ggt ttt gct agt gaa gca att gaa atg tat aac 1296 Gly Tyr Ala Gln Asn Gly Phe Ala Ser Glu Ala Ile Glu Met Tyr Asn 420 425 430 ata atg gag gaa gaa ggt gaa ata gcc gct aac caa ggg act tgg gtg 1344 Ile Met Glu Glu Glu Gly Glu Ile Ala Ala Asn Gln Gly Thr Trp Val 435 440 445 agt gtt ttg ccc gct tgt tcg cag gct gga gcg ttg cgt caa ggt atg 1392 Ser Val Leu Pro Ala Cys Ser Gln Ala Gly Ala Leu Arg Gln Gly Met 450 455 460 aaa ctc cat ggc cgt ctc ttg aag aac ggt ctt tat ttg gat gtt ttt 1440 Lys Leu His Gly Arg Leu Leu Lys Asn Gly Leu Tyr Leu Asp Val Phe 465 470 475 480 gtt gtt aca agc ctt gct gat atg tat gga aag tgt gga cga cta gag 1488 Val Val Thr Ser Leu Ala Asp Met Tyr Gly Lys Cys Gly Arg Leu Glu 485 490 495 gat gcc ctc tct ttg ttt tat caa atc cca aga gtc aac tct gtg cca 1536 Asp Ala Leu Ser Leu Phe Tyr Gln Ile Pro Arg Val Asn Ser Val Pro 500 505 510 tgg aac acc cta ata gcg tgt cac ggg ttt cac ggg cat ggc gag aaa 1584 Trp Asn Thr Leu Ile Ala Cys His Gly Phe His Gly His Gly Glu Lys 515 520 525 gca gtg atg ctg ttc aaa gaa atg ctg gat gaa gga gtg aag cca gat 1632 Ala Val Met Leu Phe Lys Glu Met Leu Asp Glu Gly Val Lys Pro Asp 530 535 540 cac att aca ttt gtg act ctg tta tcg gct tgt agc cat tct ggg tta 1680 His Ile Thr Phe Val Thr Leu Leu Ser Ala Cys Ser His Ser Gly Leu 545 550 555 560 gta gac gag ggt caa tgg tgc ttt gag atg atg caa acg gat tat ggg 1728 Val Asp Glu Gly Gln Trp Cys Phe Glu Met Met Gln Thr Asp Tyr Gly 565 570 575 ata aca ccg agc ttg aaa cat tac ggt tgt atg gta gat atg tat ggt 1776 Ile Thr Pro Ser Leu Lys His Tyr Gly Cys Met Val Asp Met Tyr Gly 580 585 590 cga gct ggt cag tta gaa acc gcg ttg aag ttt ata aaa agt atg tcg 1824 Arg Ala Gly Gln Leu Glu Thr Ala Leu Lys Phe Ile Lys Ser Met Ser 595 600 605 ctt cag cct gat gca tcc att tgg gga gct ctt ctc agc gct tgt aga 1872 Leu Gln Pro Asp Ala Ser Ile Trp Gly Ala Leu Leu Ser Ala Cys Arg 610 615 620 gtc cat ggg aat gtt gat ctt ggc aag atc gca tct gaa cat ttg ttc 1920 Val His Gly Asn Val Asp Leu Gly Lys Ile Ala Ser Glu His Leu Phe 625 630 635 640 gag gtt gaa cct gag cat gtc ggg tat cat gtt cta ttg tca aac atg 1968 Glu Val Glu Pro Glu His Val Gly Tyr His Val Leu Leu Ser Asn Met 645 650 655 tat gct agt gct gga aaa tgg gaa gga gtt gat gaa ata cgt tca ata 2016 Tyr Ala Ser Ala Gly Lys Trp Glu Gly Val Asp Glu Ile Arg Ser Ile 660 665 670 gcc cat ggc aaa ggt ttg agg aag aca ccg ggt tgg agc tca atg gaa 2064 Ala His Gly Lys Gly Leu Arg Lys Thr Pro Gly Trp Ser Ser Met Glu 675 680 685 gta gac aac aaa gtc gaa gtc ttt tac acg gga aac caa acg cat cca 2112 Val Asp Asn Lys Val Glu Val Phe Tyr Thr Gly Asn Gln Thr His Pro 690 695 700 atg tat gaa gag atg tac aga gaa tta acg gca tta caa gcg aaa ctg 2160 Met Tyr Glu Glu Met Tyr Arg Glu Leu Thr Ala Leu Gln Ala Lys Leu 705 710 715 720 aag atg att ggg tat gta ccg gac cat aga ttt gtg ttg cag gat gtt 2208 Lys Met Ile Gly Tyr Val Pro Asp His Arg Phe Val Leu Gln Asp Val 725 730 735 gaa gac gac gag aaa gag cat atc ctc atg agt cat agt gaa aga tta 2256 Glu Asp Asp Glu Lys Glu His Ile Leu Met Ser His Ser Glu Arg Leu 740 745 750 gct att gca ttt gca ctt atc gcc act cct gca aag aca acg att cgg 2304 Ala Ile Ala Phe Ala Leu Ile Ala Thr Pro Ala Lys Thr Thr Ile Arg 755 760 765 att ttt aag aat cta agg gta tgc ggc gat tgc cat agc gta acc aag 2352 Ile Phe Lys Asn Leu Arg Val Cys Gly Asp Cys His Ser Val Thr Lys 770 775 780 ttc ata tcg aaa atc act gag aga gag atc att gtg agg gat tca aac 2400 Phe Ile Ser Lys Ile Thr Glu Arg Glu Ile Ile Val Arg Asp Ser Asn 785 790 795 800 cgg ttt cat cat ttc aag aat gga gtc tgc tct tgc ggt gat tac tgg 2448 Arg Phe His His Phe Lys Asn Gly Val Cys Ser Cys Gly Asp Tyr Trp 805 810 815 tga 2451 44 816 PRT Arabidopsis thaliana 44 Met Leu Arg Gln Ile Pro Thr Cys Lys Gly Gly Arg Phe Thr Arg Val 1 5 10 15 Leu Gln Ser Ile Gly Ser Val Ile Arg Glu Phe Ser Ala Ser Ala Asn 20 25 30 Ala Leu Gln Asp Cys Trp Lys Asn Gly Asn Glu Ser Lys Glu Ile Asp 35 40 45 Asp Val His Thr Leu Phe Arg Tyr Cys Thr Asn Leu Gln Ser Ala Lys 50 55 60 Cys Leu His Ala Arg Leu Val Val Ser Lys Gln Ile Gln Asn Val Cys 65 70 75 80 Ile Ser Ala Lys Leu Val Asn Leu Tyr Cys Tyr Leu Gly Asn Val Ala 85 90 95 Leu Ala Arg His Thr Phe Asp His Ile Gln Asn Arg Asp Val Tyr Ala 100 105 110 Trp Asn Leu Met Ile Ser Gly Tyr Gly Arg Ala Gly Asn Ser Ser Glu 115 120 125 Val Ile Arg Cys Phe Ser Leu Phe Met Leu Ser Ser Gly Leu Thr Pro 130 135 140 Asp Tyr Arg Thr Phe Pro Ser Val Leu Lys Ala Cys Arg Thr Val Ile 145 150 155 160 Asp Gly Asn Lys Ile His Cys Leu Ala Leu Lys Phe Gly Phe Met Trp 165 170 175 Asp Val Tyr Val Ala Ala Ser Leu Ile His Leu Tyr Ser Arg Tyr Lys 180 185 190 Ala Val Gly Asn Ala Arg Ile Leu Phe Asp Glu Met Pro Val Arg Asp 195 200 205 Met Gly Ser Trp Asn Ala Met Ile Ser Gly Tyr Cys Gln Ser Gly Asn 210 215 220 Ala Lys Glu Ala Leu Thr Leu Ser Asn Gly Leu Arg Ala Met Asp Ser 225 230 235 240 Val Thr Val Val Ser Leu Leu Ser Ala Cys Thr Glu Ala Gly Asp Phe 245 250 255 Asn Arg Gly Val Thr Ile His Ser Tyr Ser Ile Lys His Gly Leu Glu 260 265 270 Ser Glu Leu Phe Val Ser Asn Lys Leu Ile Asp Leu Tyr Ala Glu Phe 275 280 285 Gly Arg Leu Arg Asp Cys Gln Lys Val Phe Asp Arg Met Tyr Val Arg 290 295 300 Asp Leu Ile Ser Trp Asn Ser Ile Ile Lys Ala Tyr Glu Leu Asn Glu 305 310 315 320 Gln Pro Leu Arg Ala Ile Ser Leu Phe Gln Glu Met Arg Leu Ser Arg 325 330 335 Ile Gln Pro Asp Cys Leu Thr Leu Ile Ser Leu Ala Ser Ile Leu Ser 340 345 350 Gln Leu Gly Asp Ile Arg Ala Cys Arg Ser Val Gln Gly Phe Thr Leu 355 360 365 Arg Lys Gly Trp Phe Leu Glu Asp Ile Thr Ile Gly Asn Ala Val Val 370 375 380 Val Met Tyr Ala Lys Leu Gly Leu Val Asp Ser Ala Arg Ala Val Phe 385 390 395 400 Asn Trp Leu Pro Asn Thr Asp Val Ile Ser Trp Asn Thr Ile Ile Ser 405 410 415 Gly Tyr Ala Gln Asn Gly Phe Ala Ser Glu Ala Ile Glu Met Tyr Asn 420 425 430 Ile Met Glu Glu Glu Gly Glu Ile Ala Ala Asn Gln Gly Thr Trp Val 435 440 445 Ser Val Leu Pro Ala Cys Ser Gln Ala Gly Ala Leu Arg Gln Gly Met 450 455 460 Lys Leu His Gly Arg Leu Leu Lys Asn Gly Leu Tyr Leu Asp Val Phe 465 470 475 480 Val Val Thr Ser Leu Ala Asp Met Tyr Gly Lys Cys Gly Arg Leu Glu 485 490 495 Asp Ala Leu Ser Leu Phe Tyr Gln Ile Pro Arg Val Asn Ser Val Pro 500 505 510 Trp Asn Thr Leu Ile Ala Cys His Gly Phe His Gly His Gly Glu Lys 515 520 525 Ala Val Met Leu Phe Lys Glu Met Leu Asp Glu Gly Val Lys Pro Asp 530 535 540 His Ile Thr Phe Val Thr Leu Leu Ser Ala Cys Ser His Ser Gly Leu 545 550 555 560 Val Asp Glu Gly Gln Trp Cys Phe Glu Met Met Gln Thr Asp Tyr Gly 565 570 575 Ile Thr Pro Ser Leu Lys His Tyr Gly Cys Met Val Asp Met Tyr Gly 580 585 590 Arg Ala Gly Gln Leu Glu Thr Ala Leu Lys Phe Ile Lys Ser Met Ser 595 600 605 Leu Gln Pro Asp Ala Ser Ile Trp Gly Ala Leu Leu Ser Ala Cys Arg 610 615 620 Val His Gly Asn Val Asp Leu Gly Lys Ile Ala Ser Glu His Leu Phe 625 630 635 640 Glu Val Glu Pro Glu His Val Gly Tyr His Val Leu Leu Ser Asn Met 645 650 655 Tyr Ala Ser Ala Gly Lys Trp Glu Gly Val Asp Glu Ile Arg Ser Ile 660 665 670 Ala His Gly Lys Gly Leu Arg Lys Thr Pro Gly Trp Ser Ser Met Glu 675 680 685 Val Asp Asn Lys Val Glu Val Phe Tyr Thr Gly Asn Gln Thr His Pro 690 695 700 Met Tyr Glu Glu Met Tyr Arg Glu Leu Thr Ala Leu Gln Ala Lys Leu 705 710 715 720 Lys Met Ile Gly Tyr Val Pro Asp His Arg Phe Val Leu Gln Asp Val 725 730 735 Glu Asp Asp Glu Lys Glu His Ile Leu Met Ser His Ser Glu Arg Leu 740 745 750 Ala Ile Ala Phe Ala Leu Ile Ala Thr Pro Ala Lys Thr Thr Ile Arg 755 760 765 Ile Phe Lys Asn Leu Arg Val Cys Gly Asp Cys His Ser Val Thr Lys 770 775 780 Phe Ile Ser Lys Ile Thr Glu Arg Glu Ile Ile Val Arg Asp Ser Asn 785 790 795 800 Arg Phe His His Phe Lys Asn Gly Val Cys Ser Cys Gly Asp Tyr Trp 805 810 815 45 3156 DNA Arabidopsis thaliana CDS (1)..(3156) 45 atg ggc gaa gaa tca gaa atc gcc atg gct tca gca gat cga aat aac 48 Met Gly Glu Glu Ser Glu Ile Ala Met Ala Ser Ala Asp Arg Asn Asn 1 5 10 15 cta acc cag aaa atc gcc aag atc cta aac gag act cgc aca tca tac 96 Leu Thr Gln Lys Ile Ala Lys Ile Leu Asn Glu Thr Arg Thr Ser Tyr 20 25 30 gct act cat aac cgt aag cta aag gag ctc gcc aca atc cgt tca aaa 144 Ala Thr His Asn Arg Lys Leu Lys Glu Leu Ala Thr Ile Arg Ser Lys 35 40 45 cta tct tca tct gaa tcc gaa tcc gag tcc gtg tct tct tct att ctt 192 Leu Ser Ser Ser Glu Ser Glu Ser Glu Ser Val Ser Ser Ser Ile Leu 50 55 60 caa ttc tct tcc gtg ttt ctc aaa acc cta acc cct ctt ttc atc gcg 240 Gln Phe Ser Ser Val Phe Leu Lys Thr Leu Thr Pro Leu Phe Ile Ala 65 70 75 80 gct caa cga aga acc gct gca gca gag cgt gtt gtt cga ttc gtg gcg 288 Ala Gln Arg Arg Thr Ala Ala Ala Glu Arg Val Val Arg Phe Val Ala 85 90 95 gaa ttc gct tgt ctt cgt agt aat tca gac ggt gac tcc gat tgc gac 336 Glu Phe Ala Cys Leu Arg Ser Asn Ser Asp Gly Asp Ser Asp Cys Asp 100 105 110 gag ttt cta gag gaa ttt ctt aag ttt ttg gtt gct ggg tct gtt gct 384 Glu Phe Leu Glu Glu Phe Leu Lys Phe Leu Val Ala Gly Ser Val Ala 115 120 125 gcc aat aga aac gcg aga ttc agg gcg tgt cag ata att tca gag atc 432 Ala Asn Arg Asn Ala Arg Phe Arg Ala Cys Gln Ile Ile Ser Glu Ile 130 135 140 ata ttg agg ctg cca gat gaa gtg gaa gta gcg gat gaa cta tgg gat 480 Ile Leu Arg Leu Pro Asp Glu Val Glu Val Ala Asp Glu Leu Trp Asp 145 150 155 160 gat gta att gat tgt atg atg ttg cga gtt cgg gac aaa gtt cct gtt 528 Asp Val Ile Asp Cys Met Met Leu Arg Val Arg Asp Lys Val Pro Val 165 170 175 ata cgt aca ttt gct gtg agg tct ctt tca cgt ttt gtg aat gat cct 576 Ile Arg Thr Phe Ala Val Arg Ser Leu Ser Arg Phe Val Asn Asp Pro 180 185 190 gag aat agt gac att ctt gat ttg cta ctt gag gtg ctt cct ttg gaa 624 Glu Asn Ser Asp Ile Leu Asp Leu Leu Leu Glu Val Leu Pro Leu Glu 195 200 205 cag aat ccg gag gta cgc aaa aca att gtt tta tct ctg cct cct tca 672 Gln Asn Pro Glu Val Arg Lys Thr Ile Val Leu Ser Leu Pro Pro Ser 210 215 220 aat gca acc acc caa gca att att gat tgc acg ctc gat gtt aat gaa 720 Asn Ala Thr Thr Gln Ala Ile Ile Asp Cys Thr Leu Asp Val Asn Glu 225 230 235 240 tca gta cgc aaa gca gct tac tct gtt ctg gct aat aaa gtt cct ctt 768 Ser Val Arg Lys Ala Ala Tyr Ser Val Leu Ala Asn Lys Val Pro Leu 245 250 255 cag agt ctg agc atc aag cta agg acc aca att ctt cag aga ggg ctc 816 Gln Ser Leu Ser Ile Lys Leu Arg Thr Thr Ile Leu Gln Arg Gly Leu 260 265 270 gct gat cgt gct gta aat gtt tca acg gaa tgc ttg aag cta atg aaa 864 Ala Asp Arg Ala Val Asn Val Ser Thr Glu Cys Leu Lys Leu Met Lys 275 280 285 gag caa tgg cta gct aac tat tgt gaa gga gat cct atc aca ttt ctt 912 Glu Gln Trp Leu Ala Asn Tyr Cys Glu Gly Asp Pro Ile Thr Phe Leu 290 295 300 aaa tac ctt gac gtt gaa acc tac gaa tct gtg gca gaa tcg gcg ttg 960 Lys Tyr Leu Asp Val Glu Thr Tyr Glu Ser Val Ala Glu Ser Ala Leu 305 310 315 320 gaa gtt ctg ttg agt gaa ggg ttg ata atg ccc tcc gat gat aaa agc 1008 Glu Val Leu Leu Ser Glu Gly Leu Ile Met Pro Ser Asp Asp Lys Ser 325 330 335 atc cag cag tac ata ttg tca gca gat ggt gaa act aga gat gaa agc 1056 Ile Gln Gln Tyr Ile Leu Ser Ala Asp Gly Glu Thr Arg Asp Glu Ser 340 345 350 aca tgt tct gca ccc agc att caa ctt atg gag cca gag att gct ctt 1104 Thr Cys Ser Ala Pro Ser Ile Gln Leu Met Glu Pro Glu Ile Ala Leu 355 360 365 tac tgg agg att ata tgc agg aag gtg cac caa agt gct caa gca aaa 1152 Tyr Trp Arg Ile Ile Cys Arg Lys Val His Gln Ser Ala Gln Ala Lys 370 375 380 ggt tct gat gct gct aca gcg atg ggt gct gaa gca gca gtt tat gct 1200 Gly Ser Asp Ala Ala Thr Ala Met Gly Ala Glu Ala Ala Val Tyr Ala 385 390 395 400 gct gaa gct tca gat gct aat gat ttg ctg gaa aga att ctt cct gca 1248 Ala Glu Ala Ser Asp Ala Asn Asp Leu Leu Glu Arg Ile Leu Pro Ala 405 410 415 aca gtt tct gat tat gtt gat cta gtt aaa gct cat ata gaa gca gga 1296 Thr Val Ser Asp Tyr Val Asp Leu Val Lys Ala His Ile Glu Ala Gly 420 425 430 cca aat cat cac ttt gct tcg agg cag cta tta ttg ttg ggt acg atg 1344 Pro Asn His His Phe Ala Ser Arg Gln Leu Leu Leu Leu Gly Thr Met 435 440 445 ctt gat ttc tct gat gct atg ctc cac aag act gtg agc tca ttt gtc 1392 Leu Asp Phe Ser Asp Ala Met Leu His Lys Thr Val Ser Ser Phe Val 450 455 460 cag gag ctg ctc cgc aga cct ttt gag caa gaa tta gat gaa gat ggg 1440 Gln Glu Leu Leu Arg Arg Pro Phe Glu Gln Glu Leu Asp Glu Asp Gly 465 470 475 480 aac agt att gtg att gga gat ggt ata aac ctt ggt ggt gat aaa gat 1488 Asn Ser Ile Val Ile Gly Asp Gly Ile Asn Leu Gly Gly Asp Lys Asp 485 490 495 tgg gct gag gca gtg tcg aaa ttg gct aaa aaa gtc cat gct gcc cct 1536 Trp Ala Glu Ala Val Ser Lys Leu Ala Lys Lys Val His Ala Ala Pro 500 505 510 gga gaa tat gaa gaa gtt ata ctt gtt gtt gtt gaa gaa gta gcg aga 1584 Gly Glu Tyr Glu Glu Val Ile Leu Val Val Val Glu Glu Val Ala Arg 515 520 525 ccc tgc agg gaa agg act gcg gat ttc ctg cag tgg atg cac atg ctt 1632 Pro Cys Arg Glu Arg Thr Ala Asp Phe Leu Gln Trp Met His Met Leu 530 535 540 tct ctg aca agt ctt ctc ttg gaa aat gga aaa tct tta cat tca ctg 1680 Ser Leu Thr Ser Leu Leu Leu Glu Asn Gly Lys Ser Leu His Ser Leu 545 550 555 560 caa ggg aag gct att gaa cca gaa gag ata ttg cat gct cta ttg ctt 1728 Gln Gly Lys Ala Ile Glu Pro Glu Glu Ile Leu His Ala Leu Leu Leu 565 570 575 cca ggg gca aaa cac act cac ttg gat gtg cag agg att gct ata aag 1776 Pro Gly Ala Lys His Thr His Leu Asp Val Gln Arg Ile Ala Ile Lys 580 585 590 ggc ctt ggt ctt ttt ggt ttg tta gag aag aag cct agc gaa gag cta 1824 Gly Leu Gly Leu Phe Gly Leu Leu Glu Lys Lys Pro Ser Glu Glu Leu 595 600 605 gta agg cag cta cgt gca gct ttt tgc aga agc cct cct cca att agt 1872 Val Arg Gln Leu Arg Ala Ala Phe Cys Arg Ser Pro Pro Pro Ile Ser 610 615 620 att atg gct tgc aag gca cta gtg gat ctt ggg atg tgg cat agc cca 1920 Ile Met Ala Cys Lys Ala Leu Val Asp Leu Gly Met Trp His Ser Pro 625 630 635 640 act gaa gtt gac aag gca atg gga caa gat ctc ttg tca caa ttt gag 1968 Thr Glu Val Asp Lys Ala Met Gly Gln Asp Leu Leu Ser Gln Phe Glu 645 650 655 gat gac agc att gat ttt gca ccc atc gac tta tcc aat gcc gaa gaa 2016 Asp Asp Ser Ile Asp Phe Ala Pro Ile Asp Leu Ser Asn Ala Glu Glu 660 665 670 gat atg aac ttt aaa atg ctc gat ctc ttg tat gct gga ctt gaa agt 2064 Asp Met Asn Phe Lys Met Leu Asp Leu Leu Tyr Ala Gly Leu Glu Ser 675 680 685 gat gac tgg aga gca tcc aca gag agc agt gag aat gag tca gtt aaa 2112 Asp Asp Trp Arg Ala Ser Thr Glu Ser Ser Glu Asn Glu Ser Val Lys 690 695 700 gca act gtt ggg gag ggg ttt gca aaa ctt ctt ctt cta gga gag aag 2160 Ala Thr Val Gly Glu Gly Phe Ala Lys Leu Leu Leu Leu Gly Glu Lys 705 710 715 720 tac cca aac ttg cct gca tct ttc tat ccg ttt gta ttg gga aag cta 2208 Tyr Pro Asn Leu Pro Ala Ser Phe Tyr Pro Phe Val Leu Gly Lys Leu 725 730 735 att gct tta tat ttc agt gag gag tca aaa gaa caa ctg agg ttt aaa 2256 Ile Ala Leu Tyr Phe Ser Glu Glu Ser Lys Glu Gln Leu Arg Phe Lys 740 745 750 cag tgt tta tcc gtc ttc ttt gag cac tat gcc tcc ctc tca gaa aaa 2304 Gln Cys Leu Ser Val Phe Phe Glu His Tyr Ala Ser Leu Ser Glu Lys 755 760 765 cat aag gga tac gtg tca aag gct ttt gtt cct ctt gta cgc tca atg 2352 His Lys Gly Tyr Val Ser Lys Ala Phe Val Pro Leu Val Arg Ser Met 770 775 780 tgg cct ggg att gat gga aac act aaa agt tcg tca tat gtt gta tca 2400 Trp Pro Gly Ile Asp Gly Asn Thr Lys Ser Ser Ser Tyr Val Val Ser 785 790 795 800 aat caa cgc aag cgc gca gtc caa gta tcc aga ttc atc cta cag atg 2448 Asn Gln Arg Lys Arg Ala Val Gln Val Ser Arg Phe Ile Leu Gln Met 805 810 815 atg caa acc ccg cta tac aaa aaa gag aca aga ggt gag cct gaa agc 2496 Met Gln Thr Pro Leu Tyr Lys Lys Glu Thr Arg Gly Glu Pro Glu Ser 820 825 830 cag gtc aat aaa tca cca gaa gat tct atc caa cat cca cta aac tgt 2544 Gln Val Asn Lys Ser Pro Glu Asp Ser Ile Gln His Pro Leu Asn Cys 835 840 845 aca gaa gaa ggt ttg gcc ata cgc ata gcc atc gag atg cta agc ttc 2592 Thr Glu Glu Gly Leu Ala Ile Arg Ile Ala Ile Glu Met Leu Ser Phe 850 855 860 aaa gaa aag aag act gct cat gag aag gca tat gtt gca gcg ctg tgc 2640 Lys Glu Lys Lys Thr Ala His Glu Lys Ala Tyr Val Ala Ala Leu Cys 865 870 875 880 aaa ata ctt gtg ctg ctc cat ctg aaa cca tca gaa caa aat gtg aca 2688 Lys Ile Leu Val Leu Leu His Leu Lys Pro Ser Glu Gln Asn Val Thr 885 890 895 aag ttg ttg aaa aag ctt tta agc cta ctc gca gat tct gta cgt tca 2736 Lys Leu Leu Lys Lys Leu Leu Ser Leu Leu Ala Asp Ser Val Arg Ser 900 905 910 gag aag gat ctt ctt aaa gaa gtg aaa ccg gtg ctt caa cat cta aaa 2784 Glu Lys Asp Leu Leu Lys Glu Val Lys Pro Val Leu Gln His Leu Lys 915 920 925 tct ttg gat gct tgt cca agc gag gag ctt aca caa gat caa gca aat 2832 Ser Leu Asp Ala Cys Pro Ser Glu Glu Leu Thr Gln Asp Gln Ala Asn 930 935 940 tcc atc ttt gaa ata ctc gga gtt agc tac aac tta gag atc act gaa 2880 Ser Ile Phe Glu Ile Leu Gly Val Ser Tyr Asn Leu Glu Ile Thr Glu 945 950 955 960 act aca aca gtg cca cag aca cct gct cct tgc tca aca aaa cca gct 2928 Thr Thr Thr Val Pro Gln Thr Pro Ala Pro Cys Ser Thr Lys Pro Ala 965 970 975 cgg tca aga aga cga gca agg att gaa gaa aca tcc tct gat gaa gaa 2976 Arg Ser Arg Arg Arg Ala Arg Ile Glu Glu Thr Ser Ser Asp Glu Glu 980 985 990 gaa gta gca tct cct cca cct tct gct cct aat act ttg atg acc cga 3024 Glu Val Ala Ser Pro Pro Pro Ser Ala Pro Asn Thr Leu Met Thr Arg 995 1000 1005 tca cac cgt gca agt aaa gct gct gcc ttg gcc aaa ata atg gca 3069 Ser His Arg Ala Ser Lys Ala Ala Ala Leu Ala Lys Ile Met Ala 1010 1015 1020 agc aaa gtg aaa atg agc aat gta gac gag gat gat gag gaa gaa 3114 Ser Lys Val Lys Met Ser Asn Val Asp Glu Asp Asp Glu Glu Glu 1025 1030 1035 gaa ggt tct tct gat gtt aca gca gat gat tca gat gaa taa 3156 Glu Gly Ser Ser Asp Val Thr Ala Asp Asp Ser Asp Glu 1040 1045 1050 46 1051 PRT Arabidopsis thaliana 46 Met Gly Glu Glu Ser Glu Ile Ala Met Ala Ser Ala Asp Arg Asn Asn 1 5 10 15 Leu Thr Gln Lys Ile Ala Lys Ile Leu Asn Glu Thr Arg Thr Ser Tyr 20 25 30 Ala Thr His Asn Arg Lys Leu Lys Glu Leu Ala Thr Ile Arg Ser Lys 35 40 45 Leu Ser Ser Ser Glu Ser Glu Ser Glu Ser Val Ser Ser Ser Ile Leu 50 55 60 Gln Phe Ser Ser Val Phe Leu Lys Thr Leu Thr Pro Leu Phe Ile Ala 65 70 75 80 Ala Gln Arg Arg Thr Ala Ala Ala Glu Arg Val Val Arg Phe Val Ala 85 90 95 Glu Phe Ala Cys Leu Arg Ser Asn Ser Asp Gly Asp Ser Asp Cys Asp 100 105 110 Glu Phe Leu Glu Glu Phe Leu Lys Phe Leu Val Ala Gly Ser Val Ala 115 120 125 Ala Asn Arg Asn Ala Arg Phe Arg Ala Cys Gln Ile Ile Ser Glu Ile 130 135 140 Ile Leu Arg Leu Pro Asp Glu Val Glu Val Ala Asp Glu Leu Trp Asp 145 150 155 160 Asp Val Ile Asp Cys Met Met Leu Arg Val Arg Asp Lys Val Pro Val 165 170 175 Ile Arg Thr Phe Ala Val Arg Ser Leu Ser Arg Phe Val Asn Asp Pro 180 185 190 Glu Asn Ser Asp Ile Leu Asp Leu Leu Leu Glu Val Leu Pro Leu Glu 195 200 205 Gln Asn Pro Glu Val Arg Lys Thr Ile Val Leu Ser Leu Pro Pro Ser 210 215 220 Asn Ala Thr Thr Gln Ala Ile Ile Asp Cys Thr Leu Asp Val Asn Glu 225 230 235 240 Ser Val Arg Lys Ala Ala Tyr Ser Val Leu Ala Asn Lys Val Pro Leu 245 250 255 Gln Ser Leu Ser Ile Lys Leu Arg Thr Thr Ile Leu Gln Arg Gly Leu 260 265 270 Ala Asp Arg Ala Val Asn Val Ser Thr Glu Cys Leu Lys Leu Met Lys 275 280 285 Glu Gln Trp Leu Ala Asn Tyr Cys Glu Gly Asp Pro Ile Thr Phe Leu 290 295 300 Lys Tyr Leu Asp Val Glu Thr Tyr Glu Ser Val Ala Glu Ser Ala Leu 305 310 315 320 Glu Val Leu Leu Ser Glu Gly Leu Ile Met Pro Ser Asp Asp Lys Ser 325 330 335 Ile Gln Gln Tyr Ile Leu Ser Ala Asp Gly Glu Thr Arg Asp Glu Ser 340 345 350 Thr Cys Ser Ala Pro Ser Ile Gln Leu Met Glu Pro Glu Ile Ala Leu 355 360 365 Tyr Trp Arg Ile Ile Cys Arg Lys Val His Gln Ser Ala Gln Ala Lys 370 375 380 Gly Ser Asp Ala Ala Thr Ala Met Gly Ala Glu Ala Ala Val Tyr Ala 385 390 395 400 Ala Glu Ala Ser Asp Ala Asn Asp Leu Leu Glu Arg Ile Leu Pro Ala 405 410 415 Thr Val Ser Asp Tyr Val Asp Leu Val Lys Ala His Ile Glu Ala Gly 420 425 430 Pro Asn His His Phe Ala Ser Arg Gln Leu Leu Leu Leu Gly Thr Met 435 440 445 Leu Asp Phe Ser Asp Ala Met Leu His Lys Thr Val Ser Ser Phe Val 450 455 460 Gln Glu Leu Leu Arg Arg Pro Phe Glu Gln Glu Leu Asp Glu Asp Gly 465 470 475 480 Asn Ser Ile Val Ile Gly Asp Gly Ile Asn Leu Gly Gly Asp Lys Asp 485 490 495 Trp Ala Glu Ala Val Ser Lys Leu Ala Lys Lys Val His Ala Ala Pro 500 505 510 Gly Glu Tyr Glu Glu Val Ile Leu Val Val Val Glu Glu Val Ala Arg 515 520 525 Pro Cys Arg Glu Arg Thr Ala Asp Phe Leu Gln Trp Met His Met Leu 530 535 540 Ser Leu Thr Ser Leu Leu Leu Glu Asn Gly Lys Ser Leu His Ser Leu 545 550 555 560 Gln Gly Lys Ala Ile Glu Pro Glu Glu Ile Leu His Ala Leu Leu Leu 565 570 575 Pro Gly Ala Lys His Thr His Leu Asp Val Gln Arg Ile Ala Ile Lys 580 585 590 Gly Leu Gly Leu Phe Gly Leu Leu Glu Lys Lys Pro Ser Glu Glu Leu 595 600 605 Val Arg Gln Leu Arg Ala Ala Phe Cys Arg Ser Pro Pro Pro Ile Ser 610 615 620 Ile Met Ala Cys Lys Ala Leu Val Asp Leu Gly Met Trp His Ser Pro 625 630 635 640 Thr Glu Val Asp Lys Ala Met Gly Gln Asp Leu Leu Ser Gln Phe Glu 645 650 655 Asp Asp Ser Ile Asp Phe Ala Pro Ile Asp Leu Ser Asn Ala Glu Glu 660 665 670 Asp Met Asn Phe Lys Met Leu Asp Leu Leu Tyr Ala Gly Leu Glu Ser 675 680 685 Asp Asp Trp Arg Ala Ser Thr Glu Ser Ser Glu Asn Glu Ser Val Lys 690 695 700 Ala Thr Val Gly Glu Gly Phe Ala Lys Leu Leu Leu Leu Gly Glu Lys 705 710 715 720 Tyr Pro Asn Leu Pro Ala Ser Phe Tyr Pro Phe Val Leu Gly Lys Leu 725 730 735 Ile Ala Leu Tyr Phe Ser Glu Glu Ser Lys Glu Gln Leu Arg Phe Lys 740 745 750 Gln Cys Leu Ser Val Phe Phe Glu His Tyr Ala Ser Leu Ser Glu Lys 755 760 765 His Lys Gly Tyr Val Ser Lys Ala Phe Val Pro Leu Val Arg Ser Met 770 775 780 Trp Pro Gly Ile Asp Gly Asn Thr Lys Ser Ser Ser Tyr Val Val Ser 785 790 795 800 Asn Gln Arg Lys Arg Ala Val Gln Val Ser Arg Phe Ile Leu Gln Met 805 810 815 Met Gln Thr Pro Leu Tyr Lys Lys Glu Thr Arg Gly Glu Pro Glu Ser 820 825 830 Gln Val Asn Lys Ser Pro Glu Asp Ser Ile Gln His Pro Leu Asn Cys 835 840 845 Thr Glu Glu Gly Leu Ala Ile Arg Ile Ala Ile Glu Met Leu Ser Phe 850 855 860 Lys Glu Lys Lys Thr Ala His Glu Lys Ala Tyr Val Ala Ala Leu Cys 865 870 875 880 Lys Ile Leu Val Leu Leu His Leu Lys Pro Ser Glu Gln Asn Val Thr 885 890 895 Lys Leu Leu Lys Lys Leu Leu Ser Leu Leu Ala Asp Ser Val Arg Ser 900 905 910 Glu Lys Asp Leu Leu Lys Glu Val Lys Pro Val Leu Gln His Leu Lys 915 920 925 Ser Leu Asp Ala Cys Pro Ser Glu Glu Leu Thr Gln Asp Gln Ala Asn 930 935 940 Ser Ile Phe Glu Ile Leu Gly Val Ser Tyr Asn Leu Glu Ile Thr Glu 945 950 955 960 Thr Thr Thr Val Pro Gln Thr Pro Ala Pro Cys Ser Thr Lys Pro Ala 965 970 975 Arg Ser Arg Arg Arg Ala Arg Ile Glu Glu Thr Ser Ser Asp Glu Glu 980 985 990 Glu Val Ala Ser Pro Pro Pro Ser Ala Pro Asn Thr Leu Met Thr Arg 995 1000 1005 Ser His Arg Ala Ser Lys Ala Ala Ala Leu Ala Lys Ile Met Ala 1010 1015 1020 Ser Lys Val Lys Met Ser Asn Val Asp Glu Asp Asp Glu Glu Glu 1025 1030 1035 Glu Gly Ser Ser Asp Val Thr Ala Asp Asp Ser Asp Glu 1040 1045 1050 47 1695 DNA Arabidopsis thaliana CDS (1)..(1695) 47 atg gtg ttc tct aac tct ctc ctc tct tcg acc act tcg ctt acc aca 48 Met Val Phe Ser Asn Ser Leu Leu Ser Ser Thr Thr Ser Leu Thr Thr 1 5 10 15 tgg gca aat caa tca agt tct cag tca agt ctt tcc ccg aga tat tcg 96 Trp Ala Asn Gln Ser Ser Ser Gln Ser Ser Leu Ser Pro Arg Tyr Ser 20 25 30 aca tgg cag tgt gtt tgt tta cgt aat cag aaa cgc aaa cct aag ctc 144 Thr Trp Gln Cys Val Cys Leu Arg Asn Gln Lys Arg Lys Pro Lys Leu 35 40 45 tac ttg att ccc gct cgc cat ttt tta tcg act cca att gat tca gtc 192 Tyr Leu Ile Pro Ala Arg His Phe Leu Ser Thr Pro Ile Asp Ser Val 50 55 60 tcc tcc tcc tca ata acg gct tca cgt tat gct act tct ggc gtt tct 240 Ser Ser Ser Ser Ile Thr Ala Ser Arg Tyr Ala Thr Ser Gly Val Ser 65 70 75 80 gaa gtt caa agg tct act tcg agc aac aat gtt act gag atg gaa gag 288 Glu Val Gln Arg Ser Thr Ser Ser Asn Asn Val Thr Glu Met Glu Glu 85 90 95 ttt gag atg gag ttg caa gag ttg ttt aat gaa gtc aaa agt atg gtg 336 Phe Glu Met Glu Leu Gln Glu Leu Phe Asn Glu Val Lys Ser Met Val 100 105 110 aag att ggg aag gaa agt gac gca atg gac ctt ctt cga gcg aat tat 384 Lys Ile Gly Lys Glu Ser Asp Ala Met Asp Leu Leu Arg Ala Asn Tyr 115 120 125 gtt gct gtg aaa gaa gag ctg gat tcg ggt ttg aaa ggt atc gaa caa 432 Val Ala Val Lys Glu Glu Leu Asp Ser Gly Leu Lys Gly Ile Glu Gln 130 135 140 gct gct gtt ctt gac atc att gct ttg ggg tat atg gct gtt gga gac 480 Ala Ala Val Leu Asp Ile Ile Ala Leu Gly Tyr Met Ala Val Gly Asp 145 150 155 160 ttg aaa cct gtc ccc gca ttg ctt gat atg ata aac aag att gtt gat 528 Leu Lys Pro Val Pro Ala Leu Leu Asp Met Ile Asn Lys Ile Val Asp 165 170 175 aat tta aaa gac tcc gaa cct ctt ttg gat tca gta ctt atg cat gtt 576 Asn Leu Lys Asp Ser Glu Pro Leu Leu Asp Ser Val Leu Met His Val 180 185 190 ggc agt atg tat tcg gta ata ggg aag ttt gaa aac gct ata ctc gtg 624 Gly Ser Met Tyr Ser Val Ile Gly Lys Phe Glu Asn Ala Ile Leu Val 195 200 205 cat caa agg gct att cgt ata tta gag aat aga tat ggt aaa tgc aat 672 His Gln Arg Ala Ile Arg Ile Leu Glu Asn Arg Tyr Gly Lys Cys Asn 210 215 220 act tta ctt gtc aca cca tta ctt ggt atg gct aag agt ttt gct tct 720 Thr Leu Leu Val Thr Pro Leu Leu Gly Met Ala Lys Ser Phe Ala Ser 225 230 235 240 gat gga aaa gcc act aaa gcg att ggt gtt tat gag cgt acg ttg act 768 Asp Gly Lys Ala Thr Lys Ala Ile Gly Val Tyr Glu Arg Thr Leu Thr 245 250 255 atc tta gaa cgg aat aga ggt tct gag agt gag gat cta gta gtt ccg 816 Ile Leu Glu Arg Asn Arg Gly Ser Glu Ser Glu Asp Leu Val Val Pro 260 265 270 tta ttt tca ctc ggt aaa ctt ctg ctc aaa gaa ggt aaa gct gct gaa 864 Leu Phe Ser Leu Gly Lys Leu Leu Leu Lys Glu Gly Lys Ala Ala Glu 275 280 285 gca gaa att cct ttt acc agc att gta aat ata tac aag aag ata tac 912 Ala Glu Ile Pro Phe Thr Ser Ile Val Asn Ile Tyr Lys Lys Ile Tyr 290 295 300 gga gag aga gat gga aga gtt ggc atg gcc atg tgt tcc ctt gct aat 960 Gly Glu Arg Asp Gly Arg Val Gly Met Ala Met Cys Ser Leu Ala Asn 305 310 315 320 gct aag tgc tca aaa ggt gat gca aat gaa gct gtt gat atc tac agg 1008 Ala Lys Cys Ser Lys Gly Asp Ala Asn Glu Ala Val Asp Ile Tyr Arg 325 330 335 aat gct cta cga ata atc aaa gac tca aat tat atg aca ata gac aac 1056 Asn Ala Leu Arg Ile Ile Lys Asp Ser Asn Tyr Met Thr Ile Asp Asn 340 345 350 agt atc ttg gaa aac atg agg ata gat ctt gct gag ctg ctt cat ttc 1104 Ser Ile Leu Glu Asn Met Arg Ile Asp Leu Ala Glu Leu Leu His Phe 355 360 365 gtt gga agg gga gat gaa gga cga gag tta cta gaa gaa tgc tta tta 1152 Val Gly Arg Gly Asp Glu Gly Arg Glu Leu Leu Glu Glu Cys Leu Leu 370 375 380 ata aat gag aga ttc aaa ggg aaa aac cat cct agc atg gct aca cat 1200 Ile Asn Glu Arg Phe Lys Gly Lys Asn His Pro Ser Met Ala Thr His 385 390 395 400 ctt ata aac ctt gca gca tct tat tcg cgt tcc aag aat tat gtg gag 1248 Leu Ile Asn Leu Ala Ala Ser Tyr Ser Arg Ser Lys Asn Tyr Val Glu 405 410 415 gca gaa cga ttg cta cga act tgt ttg aac atc atg gag gta tca gtt 1296 Ala Glu Arg Leu Leu Arg Thr Cys Leu Asn Ile Met Glu Val Ser Val 420 425 430 ggc tca gaa ggt cag tcc ata acc ttc cca atg cta aat ctt gca gtc 1344 Gly Ser Glu Gly Gln Ser Ile Thr Phe Pro Met Leu Asn Leu Ala Val 435 440 445 act ctt tcc caa ctg aat cgt gat gag gaa gcc gag caa ata gcc tta 1392 Thr Leu Ser Gln Leu Asn Arg Asp Glu Glu Ala Glu Gln Ile Ala Leu 450 455 460 aag gtt cta cgc atc cgt gag aag gca ttt ggc gaa gac tct ctc cct 1440 Lys Val Leu Arg Ile Arg Glu Lys Ala Phe Gly Glu Asp Ser Leu Pro 465 470 475 480 gtt gga gag gcg ctg gac tgt ttg gta tcg atc caa gcg aga cta gga 1488 Val Gly Glu Ala Leu Asp Cys Leu Val Ser Ile Gln Ala Arg Leu Gly 485 490 495 aga gat gat gga gaa ata ttg ggt ttg ctg aaa agg gtt atg atg atc 1536 Arg Asp Asp Gly Glu Ile Leu Gly Leu Leu Lys Arg Val Met Met Ile 500 505 510 caa gag aaa gag ttt ggt cct tca gct caa gaa ctc att gtt act ctc 1584 Gln Glu Lys Glu Phe Gly Pro Ser Ala Gln Glu Leu Ile Val Thr Leu 515 520 525 cag aag atc att cat ttc ttg gag aaa ttg gag atg aaa gat gat aaa 1632 Gln Lys Ile Ile His Phe Leu Glu Lys Leu Glu Met Lys Asp Asp Lys 530 535 540 ttt aag ttt agg aga aga tta gct tta ctt aga gag aga tac aag cag 1680 Phe Lys Phe Arg Arg Arg Leu Ala Leu Leu Arg Glu Arg Tyr Lys Gln 545 550 555 560 agt ctt agc tac tag 1695 Ser Leu Ser Tyr 48 564 PRT Arabidopsis thaliana 48 Met Val Phe Ser Asn Ser Leu Leu Ser Ser Thr Thr Ser Leu Thr Thr 1 5 10 15 Trp Ala Asn Gln Ser Ser Ser Gln Ser Ser Leu Ser Pro Arg Tyr Ser 20 25 30 Thr Trp Gln Cys Val Cys Leu Arg Asn Gln Lys Arg Lys Pro Lys Leu 35 40 45 Tyr Leu Ile Pro Ala Arg His Phe Leu Ser Thr Pro Ile Asp Ser Val 50 55 60 Ser Ser Ser Ser Ile Thr Ala Ser Arg Tyr Ala Thr Ser Gly Val Ser 65 70 75 80 Glu Val Gln Arg Ser Thr Ser Ser Asn Asn Val Thr Glu Met Glu Glu 85 90 95 Phe Glu Met Glu Leu Gln Glu Leu Phe Asn Glu Val Lys Ser Met Val 100 105 110 Lys Ile Gly Lys Glu Ser Asp Ala Met Asp Leu Leu Arg Ala Asn Tyr 115 120 125 Val Ala Val Lys Glu Glu Leu Asp Ser Gly Leu Lys Gly Ile Glu Gln 130 135 140 Ala Ala Val Leu Asp Ile Ile Ala Leu Gly Tyr Met Ala Val Gly Asp 145 150 155 160 Leu Lys Pro Val Pro Ala Leu Leu Asp Met Ile Asn Lys Ile Val Asp 165 170 175 Asn Leu Lys Asp Ser Glu Pro Leu Leu Asp Ser Val Leu Met His Val 180 185 190 Gly Ser Met Tyr Ser Val Ile Gly Lys Phe Glu Asn Ala Ile Leu Val 195 200 205 His Gln Arg Ala Ile Arg Ile Leu Glu Asn Arg Tyr Gly Lys Cys Asn 210 215 220 Thr Leu Leu Val Thr Pro Leu Leu Gly Met Ala Lys Ser Phe Ala Ser 225 230 235 240 Asp Gly Lys Ala Thr Lys Ala Ile Gly Val Tyr Glu Arg Thr Leu Thr 245 250 255 Ile Leu Glu Arg Asn Arg Gly Ser Glu Ser Glu Asp Leu Val Val Pro 260 265 270 Leu Phe Ser Leu Gly Lys Leu Leu Leu Lys Glu Gly Lys Ala Ala Glu 275 280 285 Ala Glu Ile Pro Phe Thr Ser Ile Val Asn Ile Tyr Lys Lys Ile Tyr 290 295 300 Gly Glu Arg Asp Gly Arg Val Gly Met Ala Met Cys Ser Leu Ala Asn 305 310 315 320 Ala Lys Cys Ser Lys Gly Asp Ala Asn Glu Ala Val Asp Ile Tyr Arg 325 330 335 Asn Ala Leu Arg Ile Ile Lys Asp Ser Asn Tyr Met Thr Ile Asp Asn 340 345 350 Ser Ile Leu Glu Asn Met Arg Ile Asp Leu Ala Glu Leu Leu His Phe 355 360 365 Val Gly Arg Gly Asp Glu Gly Arg Glu Leu Leu Glu Glu Cys Leu Leu 370 375 380 Ile Asn Glu Arg Phe Lys Gly Lys Asn His Pro Ser Met Ala Thr His 385 390 395 400 Leu Ile Asn Leu Ala Ala Ser Tyr Ser Arg Ser Lys Asn Tyr Val Glu 405 410 415 Ala Glu Arg Leu Leu Arg Thr Cys Leu Asn Ile Met Glu Val Ser Val 420 425 430 Gly Ser Glu Gly Gln Ser Ile Thr Phe Pro Met Leu Asn Leu Ala Val 435 440 445 Thr Leu Ser Gln Leu Asn Arg Asp Glu Glu Ala Glu Gln Ile Ala Leu 450 455 460 Lys Val Leu Arg Ile Arg Glu Lys Ala Phe Gly Glu Asp Ser Leu Pro 465 470 475 480 Val Gly Glu Ala Leu Asp Cys Leu Val Ser Ile Gln Ala Arg Leu Gly 485 490 495 Arg Asp Asp Gly Glu Ile Leu Gly Leu Leu Lys Arg Val Met Met Ile 500 505 510 Gln Glu Lys Glu Phe Gly Pro Ser Ala Gln Glu Leu Ile Val Thr Leu 515 520 525 Gln Lys Ile Ile His Phe Leu Glu Lys Leu Glu Met Lys Asp Asp Lys 530 535 540 Phe Lys Phe Arg Arg Arg Leu Ala Leu Leu Arg Glu Arg Tyr Lys Gln 545 550 555 560 Ser Leu Ser Tyr 49 18 DNA artificial sequence primer 49 ngttgwgnat wtsgwgnt 18 50 16 DNA artificial sequence primer 50 ngtcgaswga nawgaa 16 51 16 DNA artificial sequence primer 51 tgwgnagsan casaga 16 52 16 DNA artificial sequence misc_feature (1)..(16) 52 agwgnagwan cawagg 16 53 16 DNA artificial sequence misc_feature (1)..(16) 53 sttgntastn ctntgc 16 54 15 DNA artificial sequence misc_feature (1)..(15) 54 ntcgastwts gwgtt 15 55 16 DNA artificial sequence primer 55 wgtgnagwan canaga 16 56 30 DNA artificial sequence primer 56 actagctcta ccgtttccgt ttccgtttac 30 57 30 DNA artificial sequence primer 57 ttacctcggg ttcgaaatcg atcgggataa 30 58 33 DNA artificial sequence primer 58 aaaatcggtt atacgataac ggtcggtacg gga 33 59 36 DNA artificial sequence primer 59 gggtcttgcg gatctgaata tatgttttca tgtgtg 36 60 37 DNA artificial sequence misc_feature (1)..(37) 60 taccgaagaa aaataccggt tcccgtccga tttcgac 37 61 34 DNA artificial sequence primer 61 ggatcgtatc ggttttcgat taccgtattt atcc 34 62 29 DNA artificial sequence primer 62 attaggcacc ccaggcttta cactttatg 29 63 30 DNA artificial sequence primer 63 gtatgttgtg tggaattgtg agcggataac 30 64 30 DNA artificial sequence primer 64 taacaatttc acacaggaaa cagctatgac 30 65 34 DNA artificial sequence primer 65 tagcatctga atttcataac caatctcgat acac 34 66 34 DNA artificial sequence primer 66 gcttcctatt atatcttccc aaattaccaa taca 34 67 34 DNA artificial sequence primer 67 gccttttcag aaatggataa atagccttgc ttcc 34 68 32 DNA artificial sequence primer 68 caaactagga taaattatcg cgcgcggtgt ca 32 69 32 DNA artificial sequence primer 69 ggtgtcatct atgttactag atcgggaatt ga 32 70 30 DNA artificial sequence primer 70 cgccatggca tatgctagca tgcataattc 30 71 35 DNA artificial sequence primer 71 aattggtaat tactctttct tttcctccat attga 35 72 31 DNA artificial sequence primer 72 atattgacca tcatactcat tgctgatcca t 31 73 28 DNA artificial sequence primer 73 tgatccatgt agatttcccg gacatgaa 28 

What is claimed is:
 1. A method of identifying a herbicidal compound, comprising: a) combining a polypeptide comprising an amino acid sequence at least 90% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48 with a compound to be tested for the ability to bind to said polypeptide, under conditions conducive to binding; b) selecting a compound identified in (a) that binds to said polypeptide; c) applying a compound selected in (b) to a plant to test for herbicidal activity; and d) selecting a compound identified in (c) that has herbicidal activity.
 2. The method according to claim 1, wherein said polypeptide comprises an amino acid sequence at least 95% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.
 3. The method according to claim 2, wherein said polypeptide comprises an amino acid sequence at least 99% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.
 4. The method according to claim 3, wherein said polypeptide comprises an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.
 5. A method of identifying a herbicidal compound, comprising: c) combining a polypeptide comprising an amino acid sequence at least 90% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48 with a compound to be tested for the ability to inhibit the activity of said polypeptide, under conditions conducive to inhibition; d) selecting a compound identified in (a) that inhibits the activity of said polypeptide; c) applying a compound selected in (b) to a plant to test for herbicidal activity; and d) selecting a compound identified in (c) that has herbicidal activity.
 6. The method according to claim 5, wherein said polypeptide comprises an amino acid sequence at least 95% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.
 7. The method according to claim 6, wherein said polypeptide comprises an amino acid sequence at least 99% identical to an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.
 8. The method according to claim 7, wherein said polypeptide comprises an amino acid sequence selected from the group consisting of the even numbered SEQ ID NOs:2-48.
 9. A method for killing or inhibiting the growth or viability of a plant, comprising applying to the plant a herbicidal compound identified according to the method of claim
 1. 10. A method for killing or inhibiting the growth or viability of a plant, comprising applying to the plant a herbicidal compound identified according to the method of claim
 5. 